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SPLICE FORM 1 (cDNA sequence): 

1 GTGAGTCATA TGAAAGCTCC ACGCTGCTGA CCTCTGGCAA AAAGGGAGAG 
51 AACAAGGATA GGAGAGGCAG TGGGGGAAAG GTTCAAGTGC GGGTTTTCTC 
101 CTTGAACCTA GAAGATTATG GGTCAAGAGC TGTGTGCAAA GACTGTACAG 
151 CCTGGATGCA GCTGCTACCA TTGTTCAGAG GGAGGCGAGG CACACAGCTG 
201 TCGGAGGAGT CAGCCTGAGA CCACGGAGGC TGCGTTCAAG CTAACAGACC 
251 TAAAAGAAGC ATCATGTTCC ATGACTTCAT TTCACCCCAG GGGACTTCAA 
301 GCTGCCCGTG CCCAGAAGTT CAAGAGTAAA AGGCCACGGA GTAACAGTGA 
351 TTGTTTTCAG GAAGAGGATC TGAGGCAGGG TTTTCAGTGG AGGAAGAGCC 
401 TCCCTTTTGG GGCAGCCTCA TCTTACTTGA ACTTGGAGAA GCTGGGTGAA 
451 GGCTCTTATG CGACAGTTTA CAAGGGGATT AGCAGAATAA ATGGACAACT 
501 AGTGGCTTTA AAAGTCATCA GCATGAATGC AGAGGAAGGA GTCCCATTTA 
551 CAGCTATCCG AGAAGCTTCT CTCCTGAAGG GTTTGAAACA TGCCAATATT 
601 GTGCTCCTGC ATGACATAAT CCACACCAAA GAGACACTGA CATTCGTTTT 
651 TGAATACATG CACACAGACC TGGCCCAGTA TATGTCTCAG CATCCAGGAG 
701 GGCTTCATCC TCATAATGTC AGACTTTTCA TGTTTCAACT TTTGCGGGGC 
751 CTGGCGTACA TCCACCACCA ACACGTTCTT CACAGGGACC TGAAACCTCA 
801 GAACTTACTC ATCAGTCACC TGGGAGAGCT CAAACTGGCT GATTTTGGTC 
851 TTGCCCGGGC CAAGTCCATT CCCAGCCAGA CATACTCTTC AGAAGTCGTG 
901 ACCCTCTGGT ACCGGCCCCC TGATGCTTTG CTGGGAGCCA CTGAATATTC 
951 CTCTGAGCTG GACATATGGG GTGCAGGCTG CATCTTTATT GAAATGTTCC 
1001 AGGGTCAACC TTTGTTTCCT GGGGTTTCCA ACATCCTTGA ACAGCTGGAG 
1051 AAAATCTGGG AGGTGCTGGG AGTCCCTACA GAGGATACTT GGCCGGGAGT 
1101 CTCCAAGCTA CCTAACTACA ATCCAGGTAA TATTGATCTG AGCTTTTGAA 
1151 TACTCTGAGA ATTAGTAATG TAAGGAGAGC ATTGGCCACG CTAACAGGGC 
1201 GTTCTTGTAT TGTGAACTCA GCGGCAAAGA TGGGTGTAGA GGAATTTCTA 
1251 CATTCATATA TTCCCTGACT AATCTTTGTA TGAGGAAGAC ACTGAAAGAG 
1301 TAGCTGAGGT TAGACCAGTT CCCCAGCTCT GTAAAACACA AGTAGCAAGC 
1351 TGAATAGAAT TTGAAATGAC TATTACTGTG GATTCCACAT CCATTGTCAA 
1401 ATACCCAATG GCTCAAAAGA ACAACTCAAA AGATGGGCTC ACTTTTGGGC 
1451 CCCCTGACTG TCATAAGTGT ATTGATTAGT ATTGAATTGC ATATGTATAA 
1501 AAAGAAAGCT AATGCAACAG AACAGAGGTA GAGGCTCGCT AGGCCTAGGA 
1551 CATGCCAAGT AAGCTGTCTG TAGGTTATAC TTACTAAGAG TTCATTCATT 
1601 GCCTGTAAAC CTGACACTTG GTCATTGTCT CTCACACATT TCATCTTTCA 
1651 AGACTGGCTT CTGGGATCGA TTTAGAAGTG CTGGAAGTGT TATCCATGGG 
1701 GGAATTCTTT GAGAAGCTGT CGCAGGGCCA CATCAGAGGG ATCAGATTAA 
1751 GCAGTAGTCA CTTCAAGGAT GTTGAGACAG AGGGGAGGAG ACAGGCACTG 
1801 AACTACAGGA TGAAGGATCA TATTAGAAGC TGAAGAAGCA AATAAAGCCC 
1851 ATGCCAAAGC TGAGCTCTCA CTGGCAGGGT TGAAGGGGAG GTAGAAAGGT 
1901 ACAGATAACG ACAAGATTAG GGTGGATATG CTCCAAGCCA GA I I I I I CTA 
1951 GTCTTTATGG TCTTACATTG TTCCATTACT AAAAATGAAA TCTTCCCAAA 
2001 TTGTTGTCCT TA(_ I I I I I I I I I I I I I I I I I GAGATGGAGT TTTGCTCTTA 
2051 TCGCCCAGGC TGGAGTGCAG TGAGCCGAGA TTGCGCCACT GCATGTCCGC 
2101 AGTCCGACCT GGGCGACAGA GCGAGACTCC GTCTCAAAAC TAAAAAAAAA 
2151 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
2201 AAA (SEQ ID NO:l) 

FEATURES : 

5'UTR: 1-117 
Start Codon: 118 
Stop Codon: 1147 
3'UTR: 1150 

splice form 2 (transcript sequence): 

1 ATGGGTCAAG AGCTGTGTGC AAAGACTGTA CAGCCTGGAT GCAGCTGCTA 
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51 CCATTGTTCA GAGGGAGGCG AGGCACACAG CTGTCGGAGG AGTCAGCCTG 
101 AGACCACGGA GGCTGCGTTC AAGCTAACAG ACCTAAAAGA AGCATCATGT 
151 TCCATGACTT CATTTCACCC CAGGGGACTT CAAGCTGCCC GTGCCCAGAA 
201 GTTCAAGAGT AAAAGGCCAC GGAGTAACAG TGATTGTTTT CAGGAAGAGG 
251 ATCTGAGGCA GGGTTTTCAG TGGAGGAAGA GCCTCCCTTT TGGGGCAGCC 
301 TCATCTTACT TGAACTTGGA GAAGCTGGGT GAAGGCTCTT ATGCGACAGT 
351 TTACAAGGGG ATTAGCAGAA TAAATGGACA ACTAGTGGCT TTAAAAGTCA 
401 TCAGCATGAA TGCAGAGGAA GGAGTCCCAT TTACAGCTAT CCGAGAAGCT 
451 TCTCTCCTGA AGGGTTTGAA ACATGCCAAT ATTGTGCTCC TGCATGACAT 
501 AATCCACACC AAAGAGACAC TGACATTCGT TTTTGAATAC ATGCACACAG 
551 ACCTGGCCCA GTATATGTCT CAGCATCCAG GAGGGCTTCA TCCTCATAAT 
601 GTCAGACTTT TCATGTTTCA ACTTTTGCGG GGCCTGGCGT ACATCCACCA 
651 CCAACACGTT CTTCACAGGG ACCTGAAACC TCAGAACTTA CTCATCAGTC 
701 ACCTGGGAGA GCTCAAACTG GCTGATTTTG GTCTTGCCCG GGCCAAGTCC 
751 ATTCCCAGCC AGACATACTC TTCAGAAGTC GTGACCCTCT GGTACCGGCC 
801 CCCTGATGCT TTGCTGGGAG CCACTGAATA TTCCTCTGAG CTGGACATAT 
851 GGGGTGCAGG CTGCATCTTT ATTGAAATGT TCCAGGGTCA ACCTTTGTTT 
901 CCTGGGGTTT CCAACATCCT TGAACAGCTG GAGAAAATCT GGGAGGTGCT 
951 GGGAGTCCCT ACAGAGGATA CTTGGCCGGG AGTCTCCAAG CTACCTAACT 
1001 ACAATCCAGA ATGGTTCCCA CTGCCTACGC CTCGAAGCCT TCATGTTGTC 
1051 TGGAACAGGC TGGGCAGGGT TCCTGAAGCT GAAGACCTGG CCTCCCAGAT 
1101 GCTAAAAGGC TTTCCCAGAG ACCGCGTCTC CGCCCAGGAA GCACTTGTTC 
1151 ATGATTATTT CAGCGCCCTG CCATCTCAGC TGTACCAGCT TCCTGATGAG 
1201 GA GTCTT TGT TTACAGTTTC AGGAGTGAGG CTAAAGCCAG AAATGTGTGA 
1251 CCTTTTGGCC TCCTACCAGA AAGGTCACCA CCCAGCCCAG TTTAGCAAAT 
1301 GCTGGTGA (SEQ ID NO: 4) 

FEATURES : 
Start Codon: 1 
Stop Codon: 1306 



Homologous proteins: 
TOP 10 BLAST Hits 

Score E 
SPLICE FORM 1: 

CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 



18000005115058 /altid 
18000005205923 /altid 
18000005100533 /altid 
151000011260745 /alti 
1000682315356 /altid= 
89000000195020 /altid 
1000682315355 /altid= 

1000682315353 /altid 

1000682315354 /altid 
1000682315352 /altid= 



=gi 
=gi 
=gi 

ST 

a 

91 
91 
91 1 



6755044 /def=ref |NP_035204. 
6912584 /def=ref |NP_036527. 
2392814 /def=gb | AAB70455.il 
i 112002201 /def=gb|AAG43234. 
5579351 /def=gb | AAD45514.il 
'7292375 /def=gb| AAF47781.il 



5579349 /def=gb 
5579343 /def=gb 
5579347 /def=gb 
5579341 /def=gb 



AAD45513.1 
AAD45510.1 
AAD45512.1 
AAD45509.1 



SPLICE FORM 2: 



CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 



18000005115058 /alticfcgi 
18000005205923 /alticfcgi 
18000005100533 /altid=gi 
151000011260745 /altid=gi 
1000682315356 /altid=gi|5 
1000682315355 /altid=gi 5 
1000682315354 /altid=gi 5 
1000682315353 /altid=gi 5 
1000682315352 /altid=gi 5 



6755044 /def=ref |NP_035204 
6912584 /def=ref |NP_036527 
2392814 /def=gb|AAB70455.1 
12002201 /def=gb|AAG43234 



579351 /def=gb 
579349 /def=gb 
579347 /def=gb 
579343 /def=gb 
579341 /def=gb 



AAD45514.1 
AAD45513.1 
AAD45512.1 
AAD45510.1 
AAD45509.1 



18000005045994 /al ti d=gi | 1524004 /def =emb | CAA67862 



1| PFTA. 
1| PFTA. 

(U6239. 
1| (AF1. 
(AF1524. 

(AE003. 
(AF1524. 
(AF1523. 
(AF1524. 
(AF1523. 



,1| PFTA. 
.11 PFTA. 
I (U6239. 
.11 (AF1. 

(AF1524. 

(AF1524. 

(AF1524. 

(AF1523. 

(AF1523. 
1| (X995. 



391 
390 
389 
386 
367 
367 
367 
367 
367 
367 



469 
468 
467 
465 
422 
422 
422 
422 
422 
421 



e-107 
e-107 
e-107 
e-106 
e-100 
e-100 
e-100 
e-100 
e-100 
e-100 



e-131 
e-131 
e-131 
e-130 
e-117 
e-117 
e-117 
e-117 
e-117 
e-116 
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BLAST dbEST hits: 
score E 

SPLICE FORM 1: 

gi 9806331 /dataset=dbest /taxon=960. . . 

gi 2140968 /dataset=dbest /taxon=9606 . . . 

gi 2028058 /dataset=dbest /taxon=9606 . . . 
gi 13132599 /dataset=dbest /taxon=960. . . 
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SPLICE FORM 2: 

gi 9806331 /dataset=dbest /taxon=960. . . 
gi 2140968 /dataset=dbest /taxon=9606 . . . 
gi 2028058 /dataset=dbest /taxon=9606 ... 
gi 2140870 /dataset>dbest /taxon=9606 . . . 



1334 0.0 
954 0.0 



377 e-102 
283 5e-74 



EXPRESSION INFORMATION FOR MODULATORY USE: 
SPLICE FORM 1: 

library source (from blast dbEST hits): 

gi 9806331 uterus-endometrium adenocarcinoma cell line 

gi 2140968 testis 

gi 2028058 Lung fibroblast cell line 

gi 13132599 Kidney renal cell adenocarcinoma 

Tissue Expression: 
Whole brain 

SPLICE FORM 2: 

library source (from BLAST dbEST hits): 

gi 9806331 Uterus endometrium adenocarcinoma line 

gi 2140968 Testis 

gi 2028058 Lung fibroblast 

gi 2140870 Testis 
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SPLICE FORM 1: 

1 MGQELCAKTV QPGCSCYHCS EGGEAHSCRR SQPETTEAAF KLTDLKEASC 
51 SMTSFHPRGL QAARAQKFKS KRPRSNSDCF QEEDLRQGFQ WRKSLPFGAA 
101 SSYLNLEKLG EGSYATVYKG ISRINGQLVA LKVISMNAEE GVPFTAIREA 
151 SLLKGLKHAN IVLLHDIIHT KETLTFVFEY MHTDLAQYMS QHPGGLHPHN 
201 VRLFMFQLLR GLAYIHHQHV LHRDLKPQNL LISHLGELKL ADFGLARAKS 
251 IPSQTYSSEV VTLWYRPPDA LLGATEYSSE LDIWGAGCIF IEMFQGQPLF 
301 PGVSNILEQL EKIWEVLGVP TEDTWPGVSK LPNYNPGNID LSF (SEQ ID 
NO: 2) 

SPLICE FORM 2: 

1 MGQELCAKTV QPGCSCYHCS EGGEAHSCRR SQPETTEAAF KLTDLKEASC 
51 SMTSFHPRGL QAARAQKFKS KRPRSNSDCF QEEDLRQGFQ WRKSLPFGAA 
101 SSYLNLEKLG EGSYATVYKG ISRINGQLVA LKVISMNAEE GVPFTAIREA 
151 SLLKGLKHAN IVLLHDIIHT KETLTFVFEY MHTDLAQYMS QHPGGLHPHN 
201 VRLFMFQLLR GLAYIHHQHV LHRDLKPQNL LISHLGELKL ADFGLARAKS 
251 IPSQTYSSEV VTLWYRPPDA LLGATEYSSE LDIWGAGCIF IEMFQGQPLF 
301 PGVSNILEQL EKIWEVLGVP TEDTWPGVSK LPNYNPEWFP LPTPRSLHW 
351 WNRLGRVPEA EDLASQMLKG FPRDRVSAQE ALVHDYFSAL PSQLYQLPDE 
401 ESLFTVSGVR LKPEMCDLLA SYQKGHHPAQ FSKCW (SEQ ID NO: 5) 

FEATURES : 

Functional domains and key regions: 

SPLICE FORM 1: 

[1] PDOC00005 PS00005 PKC_PHOSPHO_SITE 

Protein kinase C phosphorylation site 

Number of matches: 2 

1 27-29 SCR 

2 70-72 SKR 

[2] PDOC00006 PS00006 CK2_PHOSPHO_SITE 

Casein kinase II phosphorylation site 

Number of matches: 3 

1 31-34 SQPE 

2 75-78 SNSD 

3 279-282 SELD 

[3] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

108-114 KLGEGSY 

[4] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 8 

1 2-7 GQELCA 

2 59-64 GLQAAR 

3 98-103 GAASSY 

4 112-117 GSYATV 

5 126-131 GQLVAL 

6 141-146 GVPFTA 

7 155-160 GLKHAN 
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8 244-249 GLARAK 
[5] PDOC00100 PS00107 PROTEIN_KINASE_ATP 

Protein kinases ATP-binding region signature 
109-132 LGEGSYATVYKGISRINGQLVALK 
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[6] PDOC00100 PS00108 PROTEIN_KINASE_ST 
Serine/Threonine protein kinases active-site signature 

220-232 VLHRDLKPQNLLI 

[7] PDOC00170 PS00191 CYTOCHROME_B5_l 

Cytochrome b5 family, heme-binding domain signature 

188-195 YMSQHPGG 



SPLICE FORM 2: 

[1] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase c phosphorylation site 

Number of matches: 3 

1 27-29 SCR 

2 70-72 SKR 

3 343-345 TPR 

[2] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 4 

1 31-34 SQPE 

2 75-78 SNSD 

3 279-282 SELD 

4 377-380 SAQE 

[3] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

108- 114 KLGEGSY 

[4] PDOC00008 PS00008 MYRISTYL 
N-myri stoylation site 

Number of matches: 8 

1 2-7 GQELCA 

2 59-64 GLQAAR 

3 98-103 GAASSY 

4 112-117 GSYATV 

5 126-131 GQLVAL 

6 141-146 GVPFTA 

7 155-160 GLKHAN 

8 244-249 GLARAK 

[5] PDOC00100 PS00107 PROTEIN_KINASE^ATP 

Protein kinases ATP-binding region signature 

109- 132 LGEGSYATVYKGISRINGQLVALK 

[6] PDOC00100 PS00108 PROTEIN_KINASE_ST 
Serine/Threonine protein kinases active-site signature 
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220-232 VLHRDLKPQNLLI 
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[7] PDOC00170 PS00191 CYTOCHROME_B5_l 

Cytochrome b5 family, heme-binding domain signature 

188-195 YMSQHPGG 



Membrane spanning structure and domains: 
(SPLICE FORMS 1 & 2) 
Helix Begin End Score Certainty 
1 283 303 0.776 Putative 



FIGURE 2E 



Docket No.: CL001098DIV II 
Serial No.: TO BE ASSIGNED 
Inventors: YAN, Chunhua et al. 
Title: ISOLATED HUMAN KINASE ... 



BLAST Alignment to Top Hit: 
SPLICE FORM 1: 

>CRA I 18000005115058 /altid=gi | 6755044 /def=ref |NP_035204.1| 
PFTAIRE 

protein kinase 1 [Mus musculus] /org=Mus musculus 
/taxon=10090 /dataset=nraa /length=469 
Length = 469 

Score = 391 bits (993), Expect = e-107 

Identities = 184/240 (76%), Positives = 208/240 (86%) 

Frame = +1 

Query: 406 

FGAASSYLNLEKLGEGSYATVYKGISRINGQLVALKVISMNAEEGVPFTAIREASLLKGL 585 

FG A SY LEKLGEGSYATVYKG S++NG+LVALKVI + EEG 
PFTAIREASLLKGL 
Sbjct: 129 

FGKADSYEKLEKLGEGSYATVYKGKSKVNGKLVALKVIRLQEEEGTPFTAIREASLLKGL 188 
Query: 586 

KHANIVLLHDIIHTKETLTFVFEYMHTDLAQYMSQHPGGLHPHNVRLFMFQLLRGLAYIH 765 

KHANIVLLHDIIHTKETLT VFEY+HTDL QYM QHPGGLHP 
NV+LF+FQLLRGL+YIH 
Sbjct: 189 

KHANIVLLHDIIHTKETLTLVFEYVHTDLCQYMEQHPGGLHPDNVKLFLFQLLRGLSYIH 248 
Query: 766 

HQHVLHRDLKPQNLLISHLGELKLADFGLARAKSIPSQTYSSEWTLWYRPPDALLGATE 945 
+++LHRDLKPQNLLIS GELKLADFGLARAKS+PS TYS+EWTLWYRPPD 

LLG+TE 
Sbjct: 249 

QRYILHRDLKPQNLLISDTGELKLADFGLARAKSVPSHTYSNEWTLWYRPPDVLLGSTE 308 
Query: 946 

YSSELDIWGAGCIFIEMFQGQPLFPGVSNILEQLEKIWEVLGVPTEDTWPGVSKLPNYNP 1125 
YS+ LD+WG GCIF+EM QG FPG+ +1 +QLE+I+ VLG P EDTWPGV 

LP++ P 
Sbjct: 309 

YSTCLDMWGVGCIFVEMIQGVAAFPGMKDIQDQLERIFLVLGTPNEDTWPGVHSLPHFKP 368 
(SEQ ID NO: 6) 



>CRA | 18000005205923 /alti d=gi | 6912584 /def=ref |NP_036527.1| 
PFTAIRE 

protein kinase 1 [Homo sapiens] /org=Homo sapiens 
/taxon=9606 /dataset=nraa /1ength=451 
Length = 451 

Score = 390 bits (990), Expect = e-107 

Identities = 184/245 (75%), Positives = 209/245 (85%) 

Frame = +1 

Query: 406 

FGAASSYLNLEKLGEGSYATVYKGISRINGQLVALKVISMNAEEGVPFTAIREASLLKGL 585 
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FG A SY LEKLGEGSYATVYKG S++NG+LVALKVI + EEG 
PFTAIREASLLKGL 
Sbjct: 111 

FGKADSYEKLEKLGEGSYATVYKGKSKVNGKLVALKVIRLQEEEGTPFTAIREASLLKGL 170 
Query: 586 

KHANIVLLHDIIHTKETLTFVFEYMHTDLAQYMSQHPGGLHPHNVRLFMFQLLRGLAYIH 765 

KHANIVLLH DII HTK ETLT VFEY+HTDL QYM +HPGGLHP 
NV+LF+FQLLRGL+YIH 
Sbjct: 171 

KHANIVLLHDIIHTKETLTLVFEYVHTDLCQYMDKHPGGLHPDNVKLFLFQLLRGLSYIH 230 
Query: 766 

HQHVLHRDLKPQNLLISHLGELKLADFGLARAKSIPSQTYSSEWTLWYRPPDALLGATE 945 
+++LHRDLKPQNLLIS GELKLADFGLARAKS+PS TYS+EWTLWYRPPD 

LLG+TE 
Sbjct: 231 

QRYILHRDLKPQNLLISDTGELKLADFGLARAKSVPSHTYSNEWTLWYRPPDVLLGSTE 290 
Query: 946 

YSSELDIWGAGCIFIEMFQGQPLFPGVSNILEQLEKIWEVLGVPTEDTWPGVSKLPNYNP 1125 
YS+ LD+WG GCIF+EM QG FPG+ +1 +QLE+I+ VLG P EDTWPGV 

LP++ P 
Sbjct: 291 

YSTCLDMWGVGCIFVEMIQGVAAFPGMKDIQDQLERIFLVLGTPNEDTWPGVHSLPHFKP 350 

Query: 1126 gnidl 1140 
L 

Sbjct: 351 ERFTL 355 (SEQIDNO:7) 



SPLICE FORM 2: 

>CRA | 18000005115058 /altid=gi | 6755044 /def=ref |NP_035204.1| 
PFTAIRE 

protein kinase 1 [Mus musculus] /org=Mus musculus 
/taxon=10090 /dataset=nraa /length=469 
Length = 469 

Score = 469 bits (1195), Expect = e-131 

Identities = 225/330 (68%), Positives = 270/330 (81%) 

Query: 97 

FGAASSYLNLEKLGEGSYATVYKGISRINGQLVALKVISMNAEEGVPFTAIREASLLKGL 156 

FG A SY LEKLGEGSYATVYKG S++NG+LVALKVI + EEG 
PFTAIREASLLKGL 
Sbjct: 129 

FGKADSYEKLEKLGEGSYATVYKGKSKVNGKLVALKVIRLQEEEGTPFTAIREASLLKGL 188 
Query: 157 

KHANIVLLHDIIHTKETLTFVFEYMHTDLAQYMSQHPGGLHPHNVRLFMFQLLRGLAYIH 216 

KHANIVLLHDIIHTKETLT VFEY+HTDL QYM QHPGGLHP 
NV+LF+FQLLRGL+YIH 
Sbjct: 189 

KHANIVLLHDIIHTKETLTLVFEYVHTDLCQYMEQHPGGLHPDNVKLFLFQLLRGLSYIH 248 
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Query: 217 

HQHVLHRDLKPQNLLISHLGELKLADFGLARAKSIPSQTYSSENA/TLWYRPPDALLGATE 276 
+++LHRDLKPQNLLIS GELKLADFGLARAKS+PS TYS+EWTLWYRPPD 

LLG+TE 
Sbjct: 249 

QRYILHRDLKPQNLLISDTGELKLADFGLARAKSVPSHTYSNEVVTLWYRPPDVLLGSTE 308 
Query: 277 

YSSELDIWGAGCIFIEMFQGQPLFPGVSNILEQLEKIWEVLGVPTEDTWPGVSKLPNYNP 336 
YS+ LD+WG GCIF+EM QG FPG+ +1 +QLE+I+ VLG P EDTWPGV 

LP++ P 
Sbjct: 309 

YSTCLDMWGVGCIFVEMIQGVAAFPGMKDIQDQLERIFLVLGTPNEDTWPGVHSLPHFKP 368 

Query: 337 

EWFPLPTPRSLHWWNRLGRVPEAEDLASQMLKGFPRDRVSAQEALVHDYFSALPSQLYQ 396 
E F + + +SL WN+L V AEDLAS++L+ P++R+SAQ AL H+YFS 

LP +L++ 
Sbjct: 369 

ERFTVYSSKSLRQAWNKLSYVNHAEDLASKLLQCSPKNRLSAQAALSHEYFSDLPPRLWE 428 

Query: 397 LPDEESLFTVSGVRLKPEMCDLLASYQKGH 426 

L D S+FTV VRL+PE + + ++ K + 
Sbjct: 429 LTDMSSIFTVPNVRLQPEAGESMRAFGKNN 458 (SEQ ID NO: 8) 



>CRA | 18000005205923 /al tid=gi | 6912584 /def=ref |NP_036527.1| 
PFTAIRE 

protein kinase 1 [Homo sapiens] /org=Homo sapiens 
/taxon=9606 /dataset=nraa /length=451 
Length = 451 

score = 468 bits (1191), Expect = e-131 

Identities = 224/330 (67%), Positives = 270/330 (80%) 

Query: 97 

FGAASSYLNLEKLGEGSYATVYKGISRINGQLVALKVISMNAEEGVPFTAIREASLLKGL 156 

FG A SY LEKLGEGSYATVYKG S++NG+LVALKVI + EEG 
PFTAIREASLLKGL 
Sbjct: 111 

FGKADSYEKLEKLGEGSYATVYKGKSKVNGKLVALKVIRLQEEEGTPFTAIREASLLKGL 170 
Query: 157 

KHANIVLLHDIIHTKETLTFVFEYMHTDLAQYMSQHPGGLHPHNVRLFMFQLLRGLAYIH 216 

KHANIVLLHDIIHTKETLT VFEY+HTDL QYM +HPGGLHP 
NV+LF+FQLLRGL+YIH 
Sbjct: 171 

KHANIVLLHDIIHTKETLTLVFEYVHTDLCQYMDKHPGGLHPDNVKLFLFQLLRGLSYIH 230 
Query: 217 

HQHVLHRDLKPQNLLISHLGELKLADFGLARAKSIPSQTYSSEWTLWYRPPDALLGATE 276 
+++LHRDLKPQNLLIS GELKLADFGLARAKS+PS TYS+EWTLWYRPPD 

LLG+TE 
Sbjct: 231 

QRYILHRDLKPQNLLISDTGELKLADFGLARAKSVPSHTYSNEWTLWYRPPDVLLGSTE 290 
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Query: 277 

YSSELDIWGAGCIFIEMFQGQPLFPGVSNILEQLEKIWEVLGVPTEDTWPGVSKLPNYNP 336 
YS+ LD+WG GCIF+EM QG FPG+ +1 +QLE+I+ VLG P EDTWPGV 

LP++ P 
. Sbjct: 291 

YSTCLDMWGVGCIFVEMIQGVAAFPGMKDIQDQLERIFLVLGTPNEDTWPGVHSLPHFKP 350 
Query: 337 

EWFPLPTPRSLHWWNRLGRVPEAEDLASQMLKGFPRDRVSAQEALVHDYFSALPSQLYQ 396 
E F L + ++L WN+L V AEDLAS++L+ P++R+SAQ AL H+YFS 

LP +L++ 
Sbjct: 351 

ERFTLYSSKNLRQAWNKLSYVNHAEDLASKLLQCSPKNRLSAQAALSHEYFSDLPPRLWE 410 

Query: 397 LPDEESLFTVSGVRLKPEMCDLLASYQKGH 426 

L D S+FTV VRL+PE + + ++ K + 
Sbjct: 411 LTDMSSIFTVPNVRLQPEAGESMRAFGKNN 440 (SEQ ID NO:9) 

>CRA| 18000005100533 /altid=gi | 2392814 /def=gb| AAB70455.il 
(U62391) 

PFTAIRE kinase [Mus musculus] /org=Mus musculus 
/taxon=10090 /dataset=nraa /length=423 
Length = 423 

Score = 467 bits (1190), Expect = e-131 

Identities = 224/330 (67%), Positives = 269/330 (80%) 

Query: 97 FGAASSYLNLEKLGEGSYATVYKGISRINGQLVALKVISMNAEEGVPFTAIREASLLKGL 156 

FG A SY LEKLGEGSYATVYKG S++NG+LVALKVI + EEG PFTAIREASLLKGL 
Sbjct: 83 FGKADSYEKLEKLGEGSYATVYKGKSKVNGKLVALKVIRLQEEEGTPFTAIREASLLKGL 142 

Query: 157 KHANIVLLHDIIHTKETLTFVFEYMHTDLAQYMSQHPGGLHPHNVRLFMFQLLRGLAYIH 216 

KHANIVLLHDIIHTKETLT VFEY+HTDL QYM +HPGGLHP NV+LF+FQLLRGL+YIH 
Sbjct: 143 KHANIVLLHDIIHTKETLTLVFEYVHTDLCQYMDKHPGGLHPDNVKLFLFQLLRGLSYIH 202 

Query: 217 HQHVLHRDLKPQNLLISHLGELKLADFGLARAKSIPSQTYSSEWTLWYRPPDALLGATE 276 

+++LHRDLKPQNLLIS GELKLADFGLARAKS+PS TYS+E\An"LWYRPPD LLG+TE 
Sbjct: 203 QRYILHRDLKPQNLLISDTGELKLADFGLARAKSVPSHTYSNENATTLWYRPPDVLLGSTE 262 

Query: 277 YSSELDIWGAGCIFIEMFQGQPLFPGVSNILEQLEKIWEVLGVPTEDTWPGVSKLPNYNP 336 

YS+ LD+WG GCIF+EM QG FPG+ +1 +QLE+I+ VLG P EDTWPGV LP++ P 
Sbjct: 263 YSTCLDMWGVGCIFVEMIQGVAAFPGMKDIQDQLERIFLVLGTPNEDTWPGVHSLPHFKP 322 

Query: 337 EWFPLPTPRSLHWWNRLGRVPEAEDLASQMLKGFPRDRVSAQEALVHDYFSALPSQLYQ 396 

E F + +SL WN+L V AEDLAS++L+ P++R+SAQ AL H+YFS LP +L++ 
Sbjct: 323 ERFTVYNSKSLRQAWNKLSYVNHAEDLASKLLQCSPKNRLSAQAALSHEYFSDLPPRLWE 382 

Query: 397 LPDEESLFTVSGVRLKPEMCDLLASYQKGH 426L D S+FTV VRL+PE + + ++ K + 
Sbjct: 383 LTDMSSIFTVPNVRLQPEAGESMRAFGKNN 412 (SEQ ID NO: 10) 
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Hmmer search results (Pfam): 
SPLICE form 1: 

Model Description 



Score 



E-value N 



PF00069 

1.6e-70 

CE00031 

0.0002 

CE00359 

0.25 1 

CE00022 

0.11 1 

CE00287 

0.00019 

CE00292 

1.9e-05 

CE00286 

1.9e-06 

CE00291 

0.00095 

CE00290 

1.4e-08 

CE00016 

3.5e-08 

CE00288 

0.0012 



Eukaryotic protein kinase domain 

CE00031 VEGFR 
1 

E00359 bone_morphogeneti c_protei n_receptor 

CE00022 MAGUK_subf ami 1 y_d 

CE00287 PTK_Eph_orphan_receptor 

CE00292 PTK_membrane_span 

E00286 PTK_EGF_receptor 
1 

CE00291 PTK_fgf_receptor 
1 

CE00290 PTK_Trk_family 

CE00016 GSK_glycogen_synthase_kinase 
1 

CE00288 PTK_lnsulin_receptor 



247.7 
14.3 
7.1 
6.0 
-59.4 
-61.0 
-76.4 
-87.0 
-100.0 
-152.6 
-196.5 



Parsed for domains: 



Model 


Domai n 


sea-f 


seq-t 




hmm-f 


hmm-t 




score 


E-value 


CE00022 


1/1 


204 


231 




126 


153 




6.0 


0.11 


CE00359 


1/1 


222 


247 




274 


299 




7.1 


0.25 


CE00031 


1/1 


204 


247 




1051 


1094 




14.3 


0.0002 


CE00290 


1/1 


106 


307 




1 


282 


[] -100.0 


1.4e-08 


CE00288 


1/1 


106 


310 




1 


269 


[] -196.5 


0.0012 


PF00069 


1/1 


103 


326 




1 


220 




247.7 


1.6e-70 


CE00286 


1/1 


103 


335 




1 


263 




-76.4 


1.9e-06 


CE00287 


1/1 


104 


335 




1 


260 




-59.4 


0.00019 


CE00016 


1/1 


1 


340 


i. 


1 


433 




-152.6 


3.5e-08 


CE00292 


1/1 


104 


341 




1 


288 




-61.0 


1.9e-05 


CE00291 


1/1 


104 


341 




1 


285 




-87.0 


0.00095 
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SPLICE FORM 2: 



Model 


Description 


Score 


E-value 


N 


PF00069 


Eukaryotic protein kinase domain 


263.6 


2.6e-75 


1 


CE00031 


CE00031 VEGFR 


14.3 


0.0002 


1 


CE00359 


E00359 bone_morphogeneti c_protei n_receptor 


7.1 


0.25 


1 


CE00022 


CE00022 MAGUK_subfamily_d 


6.0 


0.11 


1 


CE00016 


CE00016 GSK_gl ycogen_synthase_ki nase 


-42.5 


3.2e-14 


1 


CE00287 


CE00287 PTK_Eph_orphan_receptor 


-55.2 


0.0001 


1 


CE00292 


CE00292 PTK_membrane_span 


-63.9 


2.8e-05 


1 


CE00286 


E00286 PTK_EGF_receptor 


-76.4 


1.9e-06 


1 


CE00291 


CE00291 PTK_fgf_receptor 
CE00290 PTK_Trk_family 


-88.8 


0.0012 


1 


CE00290 


-94.9 


6.1e-09 


1 


CE00288 


CE00288 PTK_lnsulin_receptor 


-196.5 


0.0012 


1 



Parsed for domains: 



Model 


Domai n 


seq-f 


sea-t 


hmm-f 


hmm-t 




score 


E-value 


CE00022 


1/1 


204 


231 .. 


126 


153 




6.0 


0.11 


CE00359 


1/1 


222 


247 .. 


274 


299 




7.1 


0.25 


CE00031 


1/1 


204 


247 .. 


1051 


1094 




14.3 


0.0002 


CE00288 


1/1 


106 


310 .. 


1 


269 




-196.5 


0.0012 


CE00286 


1/1 


103 


335 .. 


1 


263 




-76.4 


1.9e-06 


CE00292 


1/1 


104 


335 .. 


1 


288 




-63.9 


2.8e-05 


CE00291 


1/1 


104 


339 .. 


1 


285 


-88.8 


0.0012 


CE00290 


1/1 


106 


354 .. 


1 


282 




-94.9 


6.1e-09 


CE00287 


1/1 


104 


367 .. 


1 


260 




-55.2 


0.0001 


PF00069 


1/1 


103 


387 .. 


1 


278 




263.6 


2.6e-75 


CE00016 


1/1 


1 


434 [. 


1 


433 




-42.5 


3.2e-14 
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1 TATAGGCCAA TGCTGTGGCT CACGCGTGTA TTCCCAGCAC TTTGGGAGGC 
51 AGGAGGATCG CTTGAGCTCA GGAATTGGAG ACAAGCCTAC GTAACATAGT 
101 GAAACCTCTG TCTGTACAAA TAATAAAAGA ATTTTCCAGG CATGGTGGCG 
151 TGCACCCCCA GTGCCAGCTA TTTGGGAGGC TGAGGTAGGA GGAATGCTTG 
201 AAGCCAGGAG TTGAAGACAA GCCTAGGCAA CATAGTGAGA CCCTGTGTCT 
251 ATAAAAAATA ATTAGCTGGT TGTCTTGGCA CAGGCCTGCA GCTAGCTACT 
301 CGGAAGACTG AGGTGGGAGG ATCACTGAGC CCAGGAGGCT GAGGCTGCAG 
351 TGAACAGTGA TCACCCAGCT GGATTCCAGC CTGGAAGACA GAGGGAGACC 
401 CTGTTTCCAA AAAAAAAAAA AAAAAAAAAT GCAAGAAAAG ACATCATAAA 
451 CTTGACCTGG GACATAACTT TTATGTGATG AAATTCACAA TCTTTTAGGA 
501 AGAAATTAGC ATTTCTGATA AAATGTATTA TAATTATATT ATTATAAATT 
551 CAAATGGAAT TAAATATTCT GAGAAACTAG CTTCTCACTC TCTCAGTTGT 
601 CAGTCAAAAC TTTAATGGTC TTTGGCCGGG TGCGGTGGCT CACGCCTGTA 
651 ATCCCAGCAC TTTGGGAGGC CGAGGCGGGT GGATCACAAG GTTAGGAGAT 
701 CGAGACCATC CTGGCTAACA CGGTGAAACC TCGTCTCTAC TAAAAATACA 
751 AAAAATTAGC CGGGTGCGGT GCCAGACGCC TGTAGTCCCA GCTGCTCAGG 
801 AGGCTGAGGC AGGAGAATGG TGTGAACCCG GGAGGCGGAG CTTGCAGTGA 
851 GCCGAGATTG CGCCACTGCA CTCCAGCCTG GGCGACAGTG CGAGACTCTG 
901 TCTCA AAAA A AAAAAAAAAA AAAAGTTGAA TGGTCTTTGA GCCAAGTAGT 
951 CTTCCTTTTC TTCTTCTTCT I I I I I I I I I I TTTTCAAAAA ATATCTCTAG 
1001 ATTGAATCTT GGAATTGGCT TAAGTCTCTT CTCTTGTGGC AATTTTGAAA 
1051 TGAAAAAATA CATGCTCATA ATTAAATTAC CTGAACATTT TAAAAAACCA 
1101 TCATGAGGTT CAAATATCAA ATATTCATAA ATATTGTTGT GATAATAGAC 
1151 ATAACTCTTA I I I I I I CCCT TAATAATGAT TGTTTATATA TCCTCCATTC 
1201 TGTCTCACTT TATGATTAGT ATATTATAGT GGCAATAATC TTAGGAATCT 
1251 AACAGAGAAA AGTGTTGCAT TTGAAGACTA CAGACTGCAA ACCAATTTAA 
1301 GCCAGATTCC TTGACATGTT GTGCTGTTAA TATAGTACTT TACATATAGT 
1351 AAACATTAAT TACATATATG TGGAAGGAAG CAAGCAAGAA AGGAAGAAAG 
1401 TATTTCATTC AAACTCCTCT CTCTCCATCA CCATTGGCTA ATATCATCAT 
1451 TTGTACAGTT AAGAACAACA TAGGTGCTCA CCACATAGTT TTTGAATAAA 
1501 TGAATGAATG GCAACCCTTC TAAGACTATT GGATACACTA TTGTTTGAAG 
1551 GCAAAGAGAT GCAGTAGATA TTTTCAACTT TTTTCCTGTT TTATGATTCT 
1601 GTGGTTTCTT TGACTACTAA AAGTTAGCTA GGTAGCAAAT TTGTTTTAAA 
1651 GTCTGAAAAC CAAAATGCTT TCAGATAAAA GGTAGGGAGA AAAATACTCC 
1701 TCAACATGTC CACTTTAGCA CCAGGAAAAC CTAATATCAA TATCACCATC 
1751 AATGATATCA TATAAATATC ATTGCATAGA TAAGCAATGT CAATCCCTAA 
1801 AAACTATGTA TACC AATAGC ACTAACTTGT GGCCAGAACA AGAACCTTAA 
1851 CTGTGCCAAA TTTTATTCTA TTCAATAACA GCTGCCTCGT TTTCAGTTGT 
1901 GCACATCTGA ATGCAAGCAA TCCCTGTCTG ATGTGGAGTT TCTTGCACTG 
1951 ATAAGGAAAA ACTGCTGAAG TTGTGAGGCT GCTCCAGGCA GAGCCATCAT 
2001 GTGAGTCATA TGAAAGCTCC ACGCTGCTGA CCTCTGGCAA AAAGGGAGAG 
2051 AACAAGGATA GGAGAGGCAG TGGGGGAAAG GTTCAAGTGC GGGTTTTCTC 
2101 CTTGAACCTA CAAGATTATG GGTCAAGAGC TGTGTGCAAA GACTGTACAG 
2151 CCTGGATGCA GCTGCTACCA TTGTTCAGAG GGAGGCGAGG CACACAGCTG 
2201 TCGGAGGAGT CAGCCTGAGA CCACGGAGGC TGCGTTCAAG GTATTTGTAT 
2251 CCC AGGAG AG AGCATCTTTC TCTATTGATA AACCAAGGAG TTCAGACACT 
2301 CCCTTTTTGT AGCGGGATCT GATTCTTCTG CGGTAGGTCT AAACCAATAA 
2351 AATGAAAATT CTATTAAAGT CACAGAAAAT TTATGGCTGT AGTTATCAAA 
2401 TTTGGGGAAT TTCTTGTAAA CCAAAAGGGA AAAATAATCC TTGGCTTTGG 
2451 GCTGCACGAA ACTCACTTGG CTTGAAGTCG AGAAAGTAGT TCTCTCAAAA 
2501 TCTCTAAGGT CCTAAATTAC AGAGCTGAAA CTTAAAAGGC AAGCTGCAGT 
2551 ATTAGTTGGT ATGCTATGGA TTTGAAACTT TAGTAATTAG TTCATGATTA 
2601 TTAGCAATGC CATAGATTAT TCCCCTACAG CAATAAATTA AGTGGACATG 
2651 AAAAAAAAAA GCCAGACTTA AACAGAAAAA AGTTGCAAAA CATCCATCAA 
2701 AGAGATTTAG GTTAACCTGA ATGTTAAAGA CACA I I I I I A GGTGAAGAAA 
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2751 GAATGTAGTA TTTCAGGAGT TGATACCATT ATGGTC I I I I TCAGGGATCT 
2801 TTCAAGAAAA GTGCC I I I I G GGGGTACAGG AAGCTTAGAA AACATTTGAA 
2851 GAGTGAAAAT GAGGCAAATA AAGAAAAAAT GGTTTTACCA GGCACTGAAT 
2901 CTTTACTTTG CATAAATTTT ATTTCTGCTC I I I LI I I I I I CTCTAGCTAA 
2951 CAGACCTAAA AGAAGCATCA TGTTCCATGA CTTCATTTCA CCCCAGGGGA 
3001 CTTCAAGCTG CCCGTGCCCA GAAGTTCAAG AGTAAAAGGC CACGGAGTAA 
3051 CAGTGATTGT TTTCAGGAAG AGGATCTGAG GCAGGGTTTT CAGTGGGTGA 
3101 GTGAGCAGCT GATGTTGATC AAGAAGAATT TAATGTGAGC TTGTCTACGG 
3151 AGGCCGGCCC TTGCTTCCAG GGCAATTACT GAGCGAGCCT TCCCAAGTCT 
3201 GCTCTGGCAA TGCTGTCTAA TTTCCCTGGG GAAAAAAAGT CAACACTAAA 
3251 AAAAAGTGTT CTTTCTCTCT TCCCTTTCAC CCGCTCCTTT TCCCCATTCC 
3301 CCTAGAGCAG AGGAAGAGCC TCCCTTTTGG GGCAGCCTCA TCTTACTTGA 
3351 ACTTGGAGAA GCTGGGTGAA GGCTCTTATG CGACAGTTTA CAAGGGGATT 
3401 AGCAGGTGAG TGACACATAG CTGGGAGAGA CTTTAGAGAT GAGAGTCCCG 
3451 CCCCCCCAAT TTCATATTAT AAAGCCAGGT GAGACATCAT AGAAGTTCAT 
3501 AGCACTCAGG ACCTGTGCAA GACACCATGG CCGACAGGGA GAGAGACATG 
3551 ATAACTTAAA CAGCCTTGAA AGAAAAACAA ACCTGCCCTG CCCTAATTAA 
3601 AATCAGCCCA CTTAAATGTT TATCAGCCTT TCCCTTCTTG CATTCAATTC 
3651 AGAGAATTCA AAGAAAATAG ACATTCTCTA CTACTGACCC AAAGAACAAT 
3701 TA TCACTC TT CAGGCCTGTG GGAGGCACAG TTGGTAAAGC GTCTCTAACA 
3751 GGI I I I I I AT ATCCCTCCGT AAATCACAAT GACAGAGTTT TGTAATGGCA 
3801 ACCTGGAATT TGCTGCTTCA TTCCTCCACC TGGCCTTTAT AGAAGAAACT 
3851 G AAGTTG GTT TCTGCAAATT ATGGTACATG CAAAAGATGA TAAATCCTAG 
3901 A I I I I I I ATA TTTGCAAAAT ACACAAAATG TCTGGAGAAT AAAAATACTG ■'. 
3951 CTTATCCAAA AGCTAAGTAC TAATTTTGGT AAACAACCAA CTTTGTTAAA 
4001 TATATGTAAA AGATC CATGA ATTCCCCTTT TAGTCAAGGT GGGAAAGTTG 
4051 GATGGTCGCT TTTTTCTTTA TGTTACTCCA ATAGAGAGAA AAGTAATGGC 
4101 TCAATAGTGG TTAAATATTA ATTTTAAAAA TATAGCTGAT CCGAGTGCAG 
4151 TGGTGTTTAC AACTACTTGA TCACAACCAG TTACAGATTT CTTTGTTCCT 
4201 TCTC CACTCC CACTGCTTCA CTTAACTGGC CAAAAACGAA AAAAGAAAAA 
4251 TTTTATATAA CTACTACAAG ACTAAATATT TATTATTTAT CTTAGTATTT 
4301 ATG CTGT TAT TAT TA I I I I I ACTTGTTAAA ACAGGATTGT AGGGGACATA 
4351 CAGTTTTATT TTATTTTATT ATTTATATAT TTATTTATTT ATTTTGGAAT 
4401 GGAATCTCTG TCACCCACGC TGGAGTGCAG TGGTGCGATG TCAGATGACT 
4451 GCAACCTCTG CCTCCTGAGT TCAAGCAACT CTCCTGCCCC TGGCCCTTTA 
4501 TACTTTCTTA ATCTGTTTTA GTCATGGTGT ACCTTAACTT TTTTCAATGC 
4551 TGAGAACATC TGCAATAAAG GACCACATTT TATTTTATTC TAAGCTTCCT 
4601 CATATCAATT TGGCCATGGT AACTGTTTTC AAGGTGGCTC GGAACGGGGG 
4651 CACCCTGGAA CATACTTGGA TACATGGGCA CCATGGACAC TTCTGATCCT 
4701 CTCTTCTGAG TTCTGACTTT GATTGTTCTG CACAGACCTT TCCAGCCCGA 
4751 AGTTT ACACA GAATTCACTT ATCTTTTCTT CTAGTTACTT TATGTTTTCT 
4801 TTTTCATTTA ACTCTTTCAT CTACTGGGAA TTTATATTGT ATATTCACAA 
4851 TCACCCCAGC TCCATTTATT AGATTTTCTT TTCTCTGATG GTTTGAAATG 
4901 CTGCCA TGAT TATATATTAG ATCTCACGAA TACTTGAAAT TCTTTCTGTT 
4951 CTAATCI I I I AAAAATCATG TTTCCTTAAT CTATCTTTTC TTATATTTGT 
5001 GCTGCATGAT TTTAATTATT GTTGCTTTAG GCTAI I I I IA GAATATATCA 
5051 AAACTCTACG TTAGAGAATT ATTGACATCT TTGCATTATT AGATTTTCTA 
5101 ATACAAATAT CCTGTAAATA TCTAATACAA CAGTCTCTGG ATGGTCACTG 
5151 TACAAGACCC TATAGAATCC CTACCCTCCA TTCCCCGGCA CACACTCAGC 
5201 TCCTCCCTGT CCTCATCTCC TTCCCCTCTC CTGCTTCAAT GACAGACTGC 
5251 TCCTGCCTCA GTCAAGGACT TTTAACTTGC TGTTCCCTCT GCCTGGAGCT 
5301 GCCTTCCACT GTTCATGCAC ACAGCTGACT CCCCCTCGCC ATCAGATTCC 
5351 TGGTTCAAGT GTTACCTTAT TTATAAAACT GTAGTCCCAG CTAGTCCAGG 
5401 GAGGCTGGAG GCAGGAGAAT CACTTGAACT TTGGAGGCAG AGGTTGCAGT 
5451 GAGCTGAGAT CGGCACCACC GCACTCCAGC CTGGGTGAGA GTGACACTGT 
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5501 CTCAAAAAAA AAAAAAAGCA TTTTCTCTTA TAAACATATT TGCCAAAAAA 
5551 CTTTTTGCAG GGTTTGGGGG AGAATTTCAC AGAACCATGT TCTGAGGAAA 
5601 ATACTTACCT CATAAAACTC TAAAACAAAA TTTCAAAGAC ATGATAAGGC 
5651 AAACAAAAGA AACTGGGGAA AAGTATATGC AAAATAGTTC AATAAAAAGG 
5701 TGGGCAAATC GGCAAATCAC AAGAAAAACA GAAAAGATCC ATAAACTTAT 
5751 GAAA AGTCA G TTTCACATAT GGTTAAAGAA ATATAAATTA AAATGCGATA 
5801 AACC I I I I I A CTTTTCAAAT AGGCCAAAAA AAAAAAGAAG ATGAAAGCGA 
5851 AAAGCCAACC CACATGATAG GGCTATGACA GAGGGACACA GGAGCCAACT 
5901 GAAAGAGCTT CCAAAGGACA AAGCTGCAAA AATATGAGCA ACCAAAAAAA 
5951 GTGGTATTAA ATTATAACCC AAAGTATAAA ATAAATATCT ATGAGTCCGT 
6001 ACTGATATAA ATAAATGATT CAATACATTA ACAAATGGGA GAGAAGAAAC 
6051 AAATCTCTCA TGCCAAATAA ATACAAATAA TTTATGTAGA TAATATACCT 
6101 TCAAAGAGGT ACAGCATAAC TCTCCACTCC TTAAGTGTGG GTCATTCATA 
6151 GTGGCATTTC TCTAAAAGTA CAGTATGAAA AAGGGGGAGA AAGAGTAACT 
6201 TTAGAGTAGA GAAACCTGAC CAACACTATC TCAGACAGGT GACTAAGGTC 
6251 AACATCAAAA GTCATAAATC ATGATGATGG TATGCACTCT I I I I I I I I I I 
6301 I I I I I I I I I I TTCTCAGATG GAGTCTCACT CTGTCGCCCA GGCTGGGGTG 
6351 CAGTGGCGCA ATCTCAGCTC ACTGCAACCT CCGGCTCCCG GGTTCAAGCG 
6401 ATTCTCCTCT C AGCCTC CTG AGTAGCTGGG ATCACAGGCG CGTGCCACCA 
6451 TACCCGGCTA A I I I I I I GTA TTTTAGTAGA GACGGGGTTT CACCATGTTG 
6501 CCCAGGCTGG TCTCAAACTC CCGAGCTCAG GCAATCCACC CACCTCAACC 
6551 TCCCAA AGTG CTAGGATTAC AGGCATGAGO. CACTGCGCCT GGCTGAGGGT 
6601 ATGCAU I I I I I I I I I I I IG AGACGGAGTC TTGCTCTGTC GCCCAGGCTG 
6651 GAGTGCAGTG GCACGATCTT GGCTCACTGC AAGCTCCGCC TCCCAGGTTC 
6701 ACGCCATTCT CCTGCCTCAG CCTCCCCAGT AGCTGGGACT ACAAGGTGCC 
6751 CCACCACCCA CACCCGGCTA ANN I I GTA I I I I I AGTAG AGACGGGGTT 
6801 TCACTGTGTT AGGCAGGATG GTCTCGATCT CCTGACCTCC TGATCCACCG 
6851 GCCTTCGCCT CCCAAAGTGC TGGGATTACA GGCGTGAGCC ACTGTGCCCG 
6901 GCCTGATGAA ATGTTAAATC TTTATTAAAT ATCGGATTGT ACAAGAATGA 
6951 ACTATAAGAG AAAAGTTACA TGGAGGAAAA AAGGTTACTA ACAATATGAT 
7001 TTTAATCCCA CTGTATTAAA AACAATGGAT TTATACCTGC ATTAAAATCT 
7051 TCTCTATTCT CAGCACTTAG CTGATATGAA TAAAATGATG AATGAGGGGA 
7101 CAGTAGGAGG AAATGAAGAG AGAGAGAATA ATGGTGTGGC CTGGGAAGAT 
7151 CAGGTAGCAC TTAGAAGCCC GCTGCAAGAA TTTGGCTTTT ATTCTAAGTA 
7201 ATGCGTGGAG ATATGGTGGC TTTTGAACAG AAAAGTGACT TGTCCTGATT 
7251 GTCATTTGAA AAGTATGCCT CCAACTACTA CTGCTGAGAG TAAATAGTAG 
7301 GAGTGCAAGT GTGCTCAGCA GGGAAACTGT TAGAAGACCA CTACAAGGCT 
7351 GGGCTTGGTG GCTCGTGCCT GTAATCCCAG CACTTTGGGA GCCTGACGTG 
7401 GGCAGATCAC CTGAGGTCAG GAGTTCGAGA CCAGCCTGGC CAAAATGGTG 
7451 AAACCCCCAT CTCTGCTAAA AATACAAAAA TTAGCCAGGT GTGGTGGGGG 
7501 TCCCCTGTAA TCCCAGCTTC TTGGGAGGCT GAGGCAGGAG AATTGCTTGA 
7551 ACCCAGGAGG TGGAGGTTGC AGTGAGCCAA GATCGTGCCA CTGTACTCCA 
7601 GCCTGGGCAA CAGAGCGAGA TTCTGTCTCA AAAAAAAAAA AAAAAAACAA 
7651 AAAA ACAA AA AAACACTACA ATAAGTCAGA TGAAAAATAA TAATAAGCTC 
7701 CAAATTTTCT ATAATGGACA TATATATATA TATCACTTTA GTAAAGAGGG 
7751 AAAA T GCTTT GGAATATATA TGTTATATAT GTATTGATAC ATGTTAAACT 
7801 TTTTATTTTG AGAAAATTAT AGATTTATAT GCTAGAATAT ATTTTGAAGT 
7851 GAAAGTGCTT TTGTTAAGCC ATCTTTGGTA TAAATTGCTG CTTTGAACCA 
7901 CCTCAATAAG TGTGTGCCCC TCAATCCCTC TCTTCTAGAA TAAATGGACA 
7951 ACTAGTGGCT TTAAAAGTCA TCAGCATGAA TGCAGAGGAA GGAGTCCCAT 
8001 T TACA GCTAT CCGAGAAGGT AAGAACAGCA GAAATGGACC CAATAGATCT 
8051 GTTTTG AGTC CTTGATTTGG TAAAAAATGT ATTGCATTGA TCCATTCAGC 
8101 ATCTA GTTTT GATTCTTCTG GAATACTATA ATTACATTTT TA I I I I I CAT 
8151 ACAAG I I I I I CAAGAAATTT ACACTGCTAT TTTATTACTT AA I I I I GAGG 
8201 AAATTGAGAT TTAAAACTAT TATATCACTT GACCAAAACT ATAAATTCAC 
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8251 TGAGCAATTA CTAATACTTT CCATGTGTTT GGCCTCATGC TAGGTGCTAA 
8301 GGCTATACCT ATATAACCTC AGAAAATTCC TATAAAAGAG AAAATATATA 
8351 ATCACACAAA TTCTTACTGG GAAATTTGCC TGAACATAAC ATGTTGTTAG 
8401 CTA GCAC TTG GAGATTCTCC AGAAGGCATG CATGTTTAGT GTTACTGCCT 
8451 GTATTTTCTC TGTGCCCTGG ACAGTACAGC AAATGGGTGA GGAACCTGGT 
8501 GTCAAATGGA CTTGGGTTTG CAGCACAGGT CCACCAATCA CTAGTGGTAT 
8551 GATGTTGGGT AGGTTACTTT AGCTATTTAT TACTCAGTTT CTTGCAGGAA 
8601 GAGGATAATA GTGGTACCTA TTTCATGGAG TTGTTATGAG TATTCAACAA 
8651 GAATATGTAT ATAAAGCACT TATCACAGAG TCAG I I I I I C AGAGTTCAAC 
8701 AAATGTTGAC CAI I I I I ATT CCATTCTTCT TTTCCTGGGT AATGTCTTAT 
8751 TTACCATCAA GATAACTAAT ACTTTATAAC ATAAACATCA AGAAGCCAAC 
8801 ATAGTGAAAT GAATCATTAA AAATATAATT TATCAACCTT TATTGCATGA 
8851 GCCATTTGAA ATAAGATGAT GATAGGATTG CTATGCATTT CAGCAAAATC 
8901 CCAGAGAAAT GGCACTTCCC TGGCCTTATT TTCTCCCACT TTTAACTACT 
8951 TATCTTCTGT TCTTTACTGA GCACATGCTA TATGCAGAGT ATGCTGCTGG 
9001 ATGCTGTGAA GGATGAGAAG AGAAACCCAT GTCTTTGTTC TATCATTTGC 
9051 AGTCTTAACA GAGCACATGA TTCAAGTTAC AAGTGTATAA AAGACATAAA 
9101 CTAAGATGAG AGCAAGTTAG TCTCAGTGTG ACTGATGGAG TCACTAGATT 
9151 TTGAACTGAG CTTGGAAGGA TAGGTTATGC AAACAAGCAT GGAAAAAGCA 
9201 ATTCAGAAAA TGAGTTTATA ACTGAATTTG ATACCLI I I I CAAAAGTCTT 
9251 TCAGAGCCCC TGAGGAATAC ATCATTTTGA ATTTAATTGG AAGGGCCAAA 
9301 TGGGCTATTG GTTTAGCCAG AGATTCATCC TGGTAGGATC AGGTGCATTC 
9351 TGGGAGAAGG CATGGTTTTA AGTGTTTAAT ATAATGGAAA CTGCATTAAC 
9401 TAATGTACTT ATTAATGGTC TCCATGAAAG GATGATCAGA TTTGGAAAGA 
9451 GATGTATGGA TAGGTTAAAG AGTATTTGTG AACGTAATAG AAATTCCCAG 
9501 GTCACCCGCA TAAGAGGAAG GTTTCCTTTG TGAGCTTGAG TTTGCCAATT 
9551 GCTTAAGATT GGCTTTGCTT AGATATTGCC CACAGCCAAG I I I I I CAGGT 
9601 TGACATTTAA CTGTAACAGT GAAACCTTTT GCCAGGTTTG CTAACAGATG 
9651 GTTCTCAGCA TGGTTCAGAA AACCTGGATC CGTTTTCTTC TGTATGCTAA 
9701 ATGTTTCTTT CATTGCATAT TTACGGAGGA ATTGCCTCTC CATCACAGGT 
9751 GTTTACAATT ACATTTAGTA GTCAACTGTG GACTTTCTTG GTTTGTTTTA 
9801 TGGACTTACC TTACCGAATG CTTTGCTCGT GTAATATTAA AAACCACAAG 
9851 AGGATTTCTG ACACATTGGA GGTTGTTAGG AATCCAATTT CCAACAATGA 
9901 ATGTTTCTTT TTACACCACT ATAAAAGCTT GGAGCCCTTG TTAAAAGAGC 
9951 CCTC TCCCC T CAAGAAGATA TGAGGCTTTA TTCGAAAACT TTGGCACTGT 
10001 CCCA I I I I I C CTGTAAGAAC TTTAAGGATG TGAGACCAGG GAGACAGGAG 
10051 GTTAAATGAG AAGGGCTGGA AGGCAAAGTA AGAACAGCTG GAGTTCATTA 
10101 GCTAAAATCC AGGGTCACTA GCTAAAAAGG CAACCGAAAG GCACGTGCAG 
10151 GAAAACTGAA CAAGTAATGC AGCCCTCTTT AAAAAGCCTT GAAGCAGGAA 
10201 TTGCTTTTCC TGAACAATTT GGCTGCCCTG ATGGTATAGC AGCCAAAGAT 
10251 TTATTAAGTA TGATTTTACT ACATATATGG TCTCTTTCTA TACAGGTAGA 
10301 ATACATGTGG CAATTTACTA GTCTGGTCAT TTGGAGTACT ATTTTCATTT 
10351 GACCTTAACA TGTGATATTA TGAAACTAGC AAAAGTATGA ACAGCACTAA 
10401 GGAACATTTT I I I I I I I I I I TTTTGAGACG AAGTTTTGCT CTTGTTGCCC 
10451 AGGCTGGAGT GCAATGGCAC AATCTTGGCT TACTGCAACC TCTGCCTTCG 
10501 GGGTTCAAGC AATTCTCCTG CCTCAGCCTC CGGAGTAGCT GGGATTACAG 
10551 GCATGTGCCA CCACACCCAG CTAATTTTGT A I I I I I AGTA GAGACAGGGT 
10601 TTCCCCATGT TGGCCAGGCT GGTCTTGAAC TCCTGACCTC AAGTGATCTG 
10651 CGTGTCTCAG CCTCCCAAGG GAAATATATC TTAATACATG TGTCAGTGCT 
10701 TTTCATACTT CTTTCAATCC TCTTAACAAT CTTTAGAGAT AGATATTATT 
10751 AATATTATTC CACTATATGG TGGTGATTCA AACCAAATCT CTCTGATTCA 
10801 AAAATTCATA GGCTTTCTAC GCACCCACTG TAGAAATATT CATTTAGCAC 
10851 CTACTATGAC CAGGTACTCT GCCGAACTGC TAGATACACA GCAATACACA 
10901 AAATAGATGT GTTCCCTACC ACCCTCATTC CTTTGCTAAT TAAGAAAAGC 
10951 AGAGGCCTTC ATAGTGCCTT GGAAATCTCT CATAATTGAC TCTAGAATTG 
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11001 TATTTTAAGT GTTGA I I I I I ACAACTAGGA GGAAATACTT TCATTTGAAT 
11051 AGGCTAATGT GTTATGTTTT TACATAGTAC AACATTTCTT AGTTTTATGA 
11101 AACTTTATAG CAATATCTTA ATATAATGTG CATTGTTTTA AATA I I I I I G 
11151 TTC AAGTG GT CAACTTTTGG TTTAAACTGA GGACTTTCAG CCTGTTAATA 
11201 GCAI I I I ICT TAGGAAGGAG TCATATAACT AATC I I I I I I GAGGACAAGG 
11251 CATATGACAT AATCTCCCCC TTCCCCTACA TAATGTATAT TTTTAAAACC 
11301 TTTATACCAA CCCTAGGAAG TAAAATGTGC TA I I I I I GTT GTAGAGATAA 
11351 AGAAATTCTA GCCTCAGAGA GGTTAGTTAA CTTGTCTGAG GTCACAGAGA 
11401 TAGTAATCAG AGTTGTTAGA ATCCATTTCT ATTCTATTTA AAATCCCTTC 
11451 TACTTTATTA TGATGAATTT GGAAATGCTT AACTAAAGTA TTTATTGTTT 
11501 AGCAACAGTA AAAATAAAAA TAGAAATCTG I I I I I ATT AT ACATTTTATA 
11551 TAAACGTTAA GGAAAATGCA GAAGAAGTAT I I I I I I AATC TTTAATTTTA 
11601 GATTCAAGGG GTACATGTCC AGGTTTGTTA CATGAGTATA TTGCATGATG 
11651 CTGAGGTATC TTGTCACCCA AATAGTGAGT ATAGTACCTG ATAGGTAGTT 
11701 TTTCAACCCG TGTCCCTCTC CCTTCCTCTC CCCTTTTGGA GTCCCTGGTG 
11751 TAGTGTCTAT TATTCCCATC TTATGTCTGT GTGTTCCCAA TACCCCCAGT 
11801 TATTAGCTTT CACTTGTAAG TGAGAACATG TGGTATTTGT TTTCTGTTCC 
11851 TGGGTTAATT CACTTAGGAT AATGGCCTCC ATCTGCATCC ATGTTGCTGC 
11901 TAAGGAAATG Gl I I I I I I I I I I I I I I I I I I TTGTGGCTGC ATAGTGTTTT 
11951 ATGGTGCCAG TGTACAAATT TTCTTTATCC AATCCACCAT TGCTGGGCAC 
12001 CTAGGTTGAG TCCATGTCTT TGCTATTGTG AATAGTGCTG TGACGAACAT 
12051 AAAAGTCTAG GTGTL I I I I I GACAGAACGA TTTATTTTCC TTTGGGTATA 
12101 TACCCAGGAA TGGAATTGCT GGGTCAAATG GTAATTCTGT TTTTGGTTTT 
12151 TTTGAGGCAG GAGATGGGAC TCGACTCCAG AGATGGGGCT TGAACACTAA 
12201 ACCAAATTTA GGACTAGCCA AAACAGGGCC TGGGGGGAGG CAGCTTTCCA 
12251 GAAGACACAC CCACCAGTGT GCCATGTCAG TTTACCATTG CCATGGCAAC 
12301 ACCTGAAAGT TACCACCCTT TCCCGTAGCA ACAACCTGAC AACCTGGAAT 
12351 TACCACTCTT TTCCTAAAAC TTTCTGCATA AACTGCCCCT TAATTTGCAT 
12401 ATAACTAAAA GTGGGTATAA ATATAACTGT AGAGCTACCT ATGAGCTGCT 
12451 ACTCTGGGCA CACTGCCTAT GTGGCAGCCC TGCTCTGCAA GGAGAGGTAC 
12501 ACCCGCTGCT GCTGAACACT GCTGCTTCAA TAAAAGCTGC TGTCTAACAC 
12551 CACAGGCTCA CCCTTGAATT CTTTCCTGGG TGAAGCCAAG AACCCTCCCA 
12601 GGCTAAGCCC CAGTTTTGGG ACTTGCCTGC CCTGCCTCAC TTTGAGAAAT 
12651 TTCTAAACTG TTTTCCACAG TGGCTGAACT AATTAACATT CCCACCCACA 
12701 GTGTATAAGC ACTCCCTTTT CTTCTCAAGC TTACCAGCAT CCATTAACTT 
12751 TTTACTTCTA AATAATAGCC I I I I I GACTG GTGTGAGATG GTATCTCATT 
12801 GAGGTTTTGA TTTGCATTTC TCTGATGATT CGTGATGTTG AGCAA I I I I I 
12851 TCATATGTTT GTTGGCCACT TGTGTGTCCA AAAGAAATAT TTTAAAGAAA 
12901 ATAATACATC ATGTTGTATA TTCATCAATT CTGATTCTAT CATTGATTCT 
12951 ACAGTGCCGG TAATTGCAGT GTTTAAATTA GAAACAGTCT CAGCTAAGAA 
13001 TCTTTTAAGA TCATTCTCTA GTAGAAAAAC ATTACAAAGT AATGATTCCC 
13051 AATCCATATA TGAGAAAACT GAGCCAAAAA TAGGCTAAGG AGCCTCCCTA 
13101 AGGTCATACA ATGAGGCAGG GGAGGAGGCT GATTAGAACT TCTGAATTGC 
13151 CAATGACCAC AAATAGTCTA GGGTAGGCCT GGTTGACAGA AAGTCTGCCA 
13201 TTGAACACCA TCATATCACA TGACAAATAC AGCAAATTCA TTGTGCATAG 
13251 TTACGTCTTT ATAAAACAAA ATAATGCCAG GATAATGGTA TGTGATCAGC 
13301 ATTACAATTC CAAAGATACC AAGACAACTA CTTATCTGAC ACTTGTCTTA 
13351 GTATTTCTCT AACATTTATC TAAAATTATT TCAATTATTT CTTTTCTCGG 
13401 AATGCATAAC TTGACTCATT GACTTGATTT ATGATTCTCA GATCAAAGGA 
13451 AATGTAACAA CAGGGACTAG AAACAL I I I I TTATTCAATG TCCAATGAGG 
13501 GTTGGGGAGG ACTCCATCAT TGACTCATTA TATAATTCCT CATAAACTCA 
13551 TTACAATTGG CCTGGCTTTC ATTAATTCAT GAGCACTTAT TGAGCACCAC 
13601 ATGCCAGGCC TGTGCT AGTG CTGGAGATGC AAAGACAAGG GCAAGTTCAA 
13651 TCCATGCCCT CAATGAGTTT ACAGCCTAAA GACGACTTTG ACTACCAGGC 
13701 CTTCATTACA TAGAGCGACA TCCTAGGACT TGGAGAATCA GCTTTCCTCT 
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13751 GGAGCCTTAA AGACATCCCT ATTTACTTTT GTGTC I I I I C TTTGAAGAAA 
13801 AACAAAAATA AGTATACATA GGATACATTA ATAATAAAAA AACAGTATTT 
13851 TATGAGACTC AGAATGCTAA TTTTAGGATC TTTGCCCTTC TCAGTTGACT 
13901 TTTGTGTCCC TCAACTGTTT AGTCTGCAGG ACAGATATCA CATCCTGCTG 
13951 TGCAGTTTAT AAAATGTCCT TAAAATTAGA AGAAAGAAAG GCCTTGTCTT 
14001 CCTGGGTTTA AGACCCACAC ATCTGAGGCT GTAGGCATTT CAGATCCCTC 
14051 TGGTGGATGG ACCAAAATGA TAAACAATAC TGTGAGATAA ATGCTTTAAA 
14101 CATC ATCT GC TCTTTCATCT GAATTCCCTA TTCATTATTC GGCAACATTC 
14151 ACAGTTTTCA TATAACGATT TCAGTAGTTC TAGGGCACCA GAAAAGCAGT 
14201 ACTAGGAATG GCCATAAAGC ATAGAATATT TATAATCTAA TGAGGGAGAC 
14251 AACTAAAAGA AAGAAGGAAT AAAAGCATCT TCAACAGAAA CACCCTTTAC 
14301 CAACCAACTA GAGGTATAGA AATGATATTA GGTAATTAGT GACCACTAAT 
14351 TTAAAGATAA ATATTTATTG AGTGCCAGAC ATTGTTCCAG GCACTGAGTA 
14401 TATAGCAATA AGCAAAAAAA ACAAAACAAA ACAAAACAAA AGTGCCCACT 
14451 CTCAATGGAG TTTATATTCT CAATTGTGGA GACAGACAAT AAACAAATAT 
14501 TTATATATAA AATGTCAGAT GGTGGTGACA GGCACTATGG AAAAGAATAA 
14551 AGCAGGGCCC AGAGAGAGAG GGTAGGATGG GGTAGAGGTG GGATGGGGTG 
14601 GAGGGCTGCT GAGGTGGGAT GGAGTAGAGG GCTGCTATCT CACCTAGAAT 
14651 GGTCAAGGAA GTCTGCACCT ATATGTATCA CTTGAGCGGA GGCTCTGAAG 
14701 AAAGTGAGGG AGGATGAAGG CAGAGAGGTG AGAAGAGAGG ATTACAGGAA 
14751 AAGACATTGG CAAGTGTAAA ATCCTGGGGT GGAAATGTGT TTGCAAGTGT 
14801 GTCTAAGGAA CAGCTAGGAG GCCAGTGAGG CTAAAGCCAA GTGAGCAAAG 
14851 ATGGGAGTGT GAGGAGATGA CAGGTCACGA TGGGCAGAGC CAACAGTAGG 
14901 GTGGGCAGGA AATCGCAAGT CCTTTGAATT TACTCTGCAG GAGATGAGAG 
14951 GCCACTGGAG GGTTTGGAAC CAGGAGGCAC ATGCCCTAAC TCATTTGAGA 
15001 AGGATAGCAG TGTCTGGCTG TCCTGTGAAG AAGTGGCCAT AGGAGGAAAG 
15051 CAGGGAAGCA GGCATTTGCA ATAATTCAGC CAACATATGA TAGTGGCTTG 
15101 GTCCAGGGTG CTGGCAGAAG ATATGGCAAG GGAGGGGTTC TGGACAATTT 
15151 GGAAGGTAAT GCCAATAGAT TTGTATGTGA TAAAAAGTTG AGAGGACTTG 
15201 ACGTGTACGA GTGGTTAATC TTCATAAAAT GGATGAATGG TTAAAAAGAT 
15251 TTCCGCAAAG AAACTGTGGG TTGAAGGTAA AACTAGTAAC TCCAATGTAA 
15301 GTGAACAACA GAGAAATACA AAACAGACAT TTTTCCTACT CCTACAAAAA 
15351 CTGTAATTAT CAAGAAGACG ACATGAAGTT TATACCCAGT ATTGTTAGCA 
15401 GGAAGCCTCA TTCCAAGTAG ATA I I I I I CC TTGGCCATTT TAGCAAGTGA 
15451 GAGCATGAGG CCATCATAAT GAACAAATCA TGCCATCATG ATTTAAAAAG 
15501 AAGCATCTGG AGTTTTAGTA ATATAGTTAG GTGAGACTAA AATTATACT A 
15551 AACATAAAAT TAAAATATCT TAACAATATT CTTAGCAATT TCAGCTTTAC 
15601 CATATCCTTT TGAAATCTAA TTTTGCTATA TGCTTTGTAA CATAGGGGTG 
15651 GGGGAAAGAG AGAAATTTAT GAGATAATTT ATAAATAAAA ATACACCTAA 
15701 AGTATAAGCA TTCTCAACTG ATGGTCAGAA AATATGGAAG GTATTCAAAA 
15751 CTCTAGCAGA AACATACCAT AAACAAGATT TTAAGACTGA AAGTAGACGT 
15801 TTAGTGGGGT TCAGGGTGAA AGGCAGGGGC AAGAAGCTGG CAAGAAGAGG 
15851 GAAGGGATAC TAATTCTAAT TTGCCTCTGT AATGCTTTAC ATTTACCAAG 
15901 GTTCCACAAA TGGTATCTGA TTCCATCCTC ATATCAACCC TATGAAGTAA 
15951 GTCAGAAAAG ACGATGTCTC TTTTCCTAAG GAATGAATTG AGACTTAGGT 
16001 TGAGATACTC TCCAGAGCTT ACTCAGATAG GAAGTGACAG GGCCAGGATT 
16051 CATATTAGGG CTTCTGGCTC CACAGACAGT TCTCCTTAAG ACTTTCAATA 
16101 AATATGTTTG ACAAATTAAG TGCTTACTCT CGGCTGAGTG TGGTACTAGG 
16151 TGGTGTGGCA GCATCTCAAA AAGGGGGAAA GTCACTCCCT CAATTCCCAT 
16201 GTGGCCTTCA GTCTGAGACT AGGGAGATTA AACAGATGCC TGAGAAGCTG 
16251 TTTATTACAT TTACAAAGCA ACACATTTGT CAAAGTGAAA TAATAAATTT 
16301 AGCCCATAAG GACTCTGGGG GCAAAAAGTA AAAATTAAGG CATTAGTCAT 
16351 TACAGCAAAT AAGGTTAACA GGTGTGATGG AGCTCCTTCG GCGTAAGTCA 
16401 GCTTAAATTG ACAAGTAAAG AGAGAAATTC ACTGGCTCAC AGATCTGATA 
16451 ACTACAGGCT GGTAGGGCAT AAGCAATATC ATCAGGAAGC CGTGTCTCTC 
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16501 ATTACCCAAC ACTGGTTTGC TGTGCATTCA TTTTATTCCC AGGCATGTTG 
16551 TCACCAGGTG TTGGTAATCT GACCCCAGCA ACTCCTGGCT AAATCCCACA 
16601 GGTTTAGCTC TCACAATAGA AAAGAAAGCA CI I CI I I I CT AATGGCACCA 
16651 GCAAAACAGG GTCTGCCAAA CTTGGGTTTT GTGCCTGTCT CTGAACCAAT 
16701 CACTAGGGTA TAGGGGAGTG CCGTGCTCTG ATGGCCAGCC CTGGGTCATA 
16751 TGCCCATTCT TGGGTAGAGG CCGGGTCAGT TCCACCAGAT GAGCATGGTC 
16801 TGAGGAAGAA GACGGTTGTT TTTCCAGGGG AAAATAGAAG TGCCCCCGCT 
16851 AGAAGGGAGA ATGGCTGTCA GGAGGGCAAA ACGACAGATT CACTAAAATA 
16901 GGTTGA TGCC TAAAGAAAAT AATTTTATTC CTAAATTTAA GGGAGTATTT 
16951 CAGTTGTTTT TAATCTTATG GAATTCTACA CTGGGAGGGA GTTGGTGCAG 
17001 GAGATTCATG ATATGCAGGC ATAGGCTACA GAATAATGCT TTGAG I I I I I 
17051 ATCCTTTACT TTTCCTTTCC TTTAAGCTTT AAAGACACGA TTTCTTCATG 
17101 CAGGGTTGCC CTGAGGTGAG CCTCATCATC TC I I I I I I I I GAGATGGAGT 
17151 CTCGCTCTGT CACCCAGGCC AGAGTGCAGT GGTGCAATCT TGGCTCACTG 
17201 CAACCTCCAC CTCCCAGGTT CAAGTGATTC TCTTGCCTCA GCTTCCCGAG 
17251 TGGC TGGGAT TACAGGTGTG CACCACCAGG CCCCACCACG CCCGGCTAAT 
17301 TTTTGTATTT TTAGTAGAGA CGGGGTTTCA CCGCGTTGGC CAGGCTGGTC 
17351 TCAAACTCCT GACCTCAGGT. GATCCACCCA CCTCGGCCTC CCTGAGTGCT 
17401 GGG ATCA CAA GCATGCGCTA CCACGCCCGG CCTCATGGTC TCTTTATTGT 
17451 ACCTTTTCTA GTCTCTGCTT TCCTGAAGCC AGAGGTCTTC CTATCTCCAG 
17501 AAGCTCCAAA GACACACTTT CAAACCCCTC CCAGTCACTT GGCCTTTTCT 
17551 GATGACTTCT TTCCTTCAAG GCTGCCTTTA GTAACCGATT ATTGAAGAGG 
17601 CAAGAGAAAG CCCTCAGGGT TCTCCACTTT CACCTCCCTG GGCTCCCCAA 
17651 GTTTGGCCGA CTCCTC I I I I CAAGTTCACA TTTTCTCCTT. TCCACAGAGG 
17701 TTTGCAACAT TACCTTTAAG AAATCATCTC CAGTCTCTAT CACGTTTCAA 
17751 CAGTTCTTTA CCCCATGCTT TTATCCCTGT CTCCCACCAA TCATATCCAC 
17801 CGGCCCTATT GACCGCTTGT GGGAGTTAGA ATTTTGGAGA CTGGTCATAT 
17851 GTCACAAAGT CCTGCTCTAG AAGGCAGAAC ACTCCATTTC CTGCTCCTCC 
17901 AAAGCCCTTT ATCTCTCCAG GCCTCTCCTC CTGTAGCTCT GAAGCTGGAT 
17951 TGATGAGATT CCCAGAGGGG AGCATTTAGT GCTCTGAGTG CTTTGATGAA 
18001 ATTGATTAGG TAAATGGAAA CATA I I I I I I GCAACCACTC TAGCCTGTAG 
18051 AAACAATAAG TTGCAATGAT TTGCCATTTT TGAAATAATG AAGGTTCTTT 
18101 GTAATTTTAA ATATTCI I I I GCCACAAGAG ATTGTTTTCC AGCAGTAAAA 
18151 TAACCAGAAT GTTTGATTTG AAATGTTGAA AAAATATATA CCGTCTGATA 
18201 TCTTTA GAGC AGCACTTTCA TTATCAATGA TGGATTTAAC ATTTTGTTTA 
18251 A I I I I I CTAG CTTCTCTCCT GAAGGGTTTG AAACATGCCA ATATTGTGCT 
18301 CCTGCATGAC ATAATCCACA CCAAAGAGAC ACTGACATTC G I I I I I GAAT 
18351 ACATGGTGAG TTGTTCGAGC ATTTTACAAC ACTTGAGAAA AATAACCTGG 
18401 TACTTGTATA ATGAATCTGT TAATATTTTA TGGCATGATA AAACTTTTAT 
18451 TATAATGTGA AAAGTATCAT GGAAATTTTC ATTATTGTGA TTAGTAGAAC 
18501 CTTATTGTTC CCACATCCAT CTTTGGTCCT GCTTCCTTAC CCATGACTTT 
18551 TGCTGTCCCT TTTCCCCTCA TCAGCAATAA TAAATGAGGA TCTTGAGTTT 
18601 ACCTTCTAAA TAAAACTTTT GCACTTATTT TTAATCTAAT TTTAATCACT 
18651 ATCTGAGCAG AATCCAACAT I I I I I CATTG ACAATAAAGG TAAAAATCAC 
18701 AAGATA TTTA AAAATTGTAT GCAAGCTTGC TAAAGAATAA CTCATGTTGT 
18751 A I I I I I GGAA GAAAAAATAT TTAAATAAGC AGAAAGAACT TATAAGGTAT 
18801 GTGTACTTGA CTTGCCTCCA AGGACACTTG GAGAGTGAAA AATTCCTGCG 
18851 TCGTTGTGTT CAGTGCCAGT CATTTAAAAT GAGCATCTCT GTGCTGAGAA 
18901 ACAGGCTTTG TTCTAAGAGC AGCCAGTTAG AAAGACACAC TGTGTTTGAC 
18951 CTTAACAGTG GGTTCTCAGA AAACCTGGTT ATATTCCTTT TGCACCTTAT 
19001 TCTTAAAATT CTGTACTTCG TGATACCTTC TGACAGTCAA GTCAATGTTC 
19051 TGCTTT AGGA TGCTATCTAA GCACCACTAA ATTCACTCAC TTCTCTTTCT 
19101 CCGCTGTTTT ATTTAGCACA CAGACCTGGC CCAGTATATG TCTCAGCATC 
19151 CAGGAGGGCT TCATCCTCAT AATGTCAGAG TGAGTACGTT AAGGGTCAGG 
19201 ACCCTCTCCT GGCTTGCCCA CAGAAGGAGA ATTCTGAAAC AGACTGTCTC 
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19251 ACAAAGCAAA GT CCTA TGAT ACTAAATAAG AGGATGGACA TCACTGATAT 
19301 TCCAGAAAAA AGTTTTGTTT TGTTTTCGTT TTTG I I I I I I TTTAAAAAGG 
19351 AAAGAAAAAA GAAAAAGAGT TGCTGAGTTG CTTCTTAAGA TATGGAGCAA 
19401 TGTTT TCTGA GCAACCTAAT GCTGTCAGTC ATGGCTACAT GCAAATGTGC 
19451 CTTTAGATGA ATAAACGAGT GAAGGAGAAT TATACTAAAA GGAAAAAAGT 
19501 AAAGCTAGGC CATCAAAAAA TAAATACCTT CTTCATATCA GATTACTGTG 
19551 GTCTAAGGTG AAGTCTGCAA TACTTGTACT AGCAGATCCT ATTATATATG 
19601 TGGCCCTAAC TCCCA I I I I I CCAGTCATTA GAATCAAAAT AATAAACTCT 
19651 TAATTAGCTA TAATTCTACA TCTGTTATAA ATTTTAGAAA CCATTTATAT 
19701 TTCATACTTT TCATTCCCTA AGGTTTTATT GGCATTAATT AATTGATTGG 
19751 CTCTTAAAAT AACCGTATGA AATTTGTATA TGATGTATTT ATTCATTTAA 
19801 CTAATATTTA TTTATGTATT CATTTATTCA TTCATTTAAG AAATATTTAT 
19851 TGAGTACTTA TTGCGTAATA AGTTCTGGGG TTTCAATAAT GAATAAGTTC 
19901 TGI I Id I AT TTTGAATGAG CTTAAAGTCC AGTAAGATAT ATGAACTTAA 
19951 ATAGGCAGTG AGGGCCAGTC TTCAAGCAAC AGCAATGCAA GATGGCAGCC 
20001 ACCATGGGCT CAGGCAATTT ATGAAAGCCA AATATACAGC CTTAAAATAG 
20051 AATGTGGACC TAAATACCCA GAAGAACTCC CCTTTGTAAG ATTTGTAACA 
20101 AAAATTAATA TGAGTAGAGT TAATAGTTCT AATGGAATGG TGAACCCAAG 
20151 AGCCATATCA GCGCTAGCAA AATGGCAGAA TTCATATATC ATCAAAGTTA 
20201 TCCTTCAAGA GCTTCAGCGC CTAATGATGT CTAAAGAAAA TGTGAAACGC . 
20251 CCTCAGCCAT CTGAAGGACA GTGTTACAGC AATTGATCAA AAAGAAAAAC 
20301 CACAGGCCCT TC CCCT TCCC CCATACTTGA TGTAAGCAGT CTTCATTTTC 
20351 CATAGTAGTA AATTTTCTAG ATACAGCTTG TAGAGCTCAA AGTACTGGAA 
20401 AGA AAGCT CC CATTCAAAGG AAATTTATCT TAAGATACTG TAAATGATAC 
20451 TAA I I I I I GT ACATTTGGAA TATATAAGTT GTTAGCCTGG CGCGGTGGCT 
20501 CACGCCTGTA ATCCCAGCCC TTTGGGAGGC CAGAGTGGGC AGATCATGAG 
20551 GTCAGGAGTT TGAGACCAGC CTAGCCAACA TGGTGAAACC CCGTCTCTAC 
20601 TAAAGATACA AAAAATTAGC CAGGTGTGGT GGCGCACACC TGTAACCCCA 
20651 GCTGCTCGAG AGAGTGAGGC AGGAGAATTG CTTGAACCCA GGAGGCAGAG 
20701 GTGCAGCGAG CAAAGATCAC ACCAATGCAC TGTAGCCTGG ATGACAGGGC 
20751 AAGACTCCAA CTCAAAAAAA AAAAAAAAAA AGAAATATGT AAGTTGTGCT 
20801 ATAACAAATA AATAGGCAGT GAGAAGCAAA GTGCTAAAGC CTATGACCAT 
20851 GGTAACTAGG AA TACT GTGG GAACACATAA TAAGGGAACC TAACCCAGTC 
20901 CTGGAAGTAA GGTTTTGGAA AGGAATGTTT GAGGACAAAG GGTTAAAGAG 
20951 AGTGAAAAAA AAAATTAAAA TACCAGTTTA GCTGTGTGGA GAATGGGATA 
21001 GGGAGCTAAC TAGAGAAATC AAATAGGAAT GTTTCATGGT ATGTTAAGGA 
21051 CCCTGG TAAG GGTGAAGACC ATTACATTAT CTGCACCATC GCGGGACTTT 
21101 I I I I I I ATGG TAATGCTTGG CAATTTAAAT AGAGGAGCAG AGAATGTAGA 
21151 CAGTTGGATT GAGTCAGAGT TGAAGTTCTG CCAGACATGT GAAAGGAAGA 
21201 GACAGGTAGG CAAGAGAGTT GAAGAGATTA TCAAGACAGA AGTTAATGTG 
21251 CTGGCCAGTG GCATCTAGTC TGAGTCTAAT CTGAGGGAAG GAAGTGAAGA 
21301 TAAGCAGCTT GCTGATAGTT ATGAAGAGAG TGGAAGGCTT CAAGGACCTA 
21351 CAGGTGTTGA TTAAATAGAA GAATGATTGG AGAAAGAATA ACTGTGAGAG 
21401 AGTGAGATTT TCAGGCTTGA GTGACTCTCA CATACCAGAC ACTGTGCTAA 
21451 ATGCTTCAAA GACATGATCC CTGCCCTCAA GGGACTTACA GCCAAAAACA 
21501 AGAGATAAGA AATACACACC AATACTATTA TAGGACACTT GTGTAGAATA 
21551 TCAAGAAAGA AATACGATCT AGTACTGTAG ATGTGCAACG GCATCAAAGA 
21601 TATCTTCTAG TTTCAAGAAG TTTCAGATCG GCCGGGCGCG GTGGCTCACG 
21651 CCTGTAATCC CAGCACTTTG GGAGGCCGAG GCGGGTGGAT CACAAGGTCA 
21701 GGAGATCAAG ACCATCCTGG TTAACACGGT GAAACCCCGT CTCTACAAAA 
21751 AATATAAAAA ATTAGCCAGG CGTGGTGGCG GGCGCCTGTA GTCCCAGCTA 
21801 CTCAGGAGGC TGAGGCAGGA GAATGGCGTG AACCCGGGAG GTAGAGTTTG 
21851 CGTGAGCCGA GATCGCGCCA CTGCGCTCCA GCCTGGGCGA CAGAGTGAGA 
21901 CTGCGTCTCA AAAAAAAAAA AAAAAAAAAA AAAGTTTCAG ATCTTAAACA 
21951 CACTGCATTT CAACAGTCTA GAATAGGAGA GCATGTTACA GGGAGAGAAA 
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22001 ATGTTTTCAG CAAAGGTACA 
22051 AGAGGACCCC AGAGTCATGG 
22101 TGCAATCTCC AGGTTCCATT 
22151 CAACCTGCAT TTTAAAAAAT 
22201 TATTAGCATG CATTACATTT 
22251 TTGCTCCAAT TGTAACTGTA 
22301 AGATACATAT ATATTTCTTA 
22351 CACTGGCCTG GTCTAACTTC 
22401 GAGAGGACAA ACATTTTGCC 
22451 TAAAAGTGAT TCTTAGTTAA 
22501 TCTAGTTCTG TTGGCCTGAA 
22551 CAAGTCTGGG GAAATATATT 
22601 GAAATGTGGA CCTTAGGCAG 
22651 ACTTTGGGAG GCCAAGTCAA 
22701 TAGCATGGGC AACATAGCAA 
22751 AGCTGAGCAT GGTGGCATAC 
22801 GTGGGTGGAT CGCTTGACAC 
22851 ACACAACTGC ACTCCAGCTT 
22901 AAGAAAGAAA TGTGGATTTT 
■22951 AAGTTTGAGT TGAGAGAAAA 
23001 TGGTCTGTAG GATATATGAA 
23051 ACTTAAGTGT TTGTTTGTTT 
23101 AGGCTAGAGT GCAATGGTGC 
23151 AGCCTCAAAT TATCTTCCCA 
23201 GTGTGAGCCA AAGGGTTTAT 
23251 TTCAGTGAGC AGACAAGTGA 
23301 AAGCCTGAGC CTCTGCATTC 
23351 TGGTCAAGAG CCAAGCACTG 
23401 TCAAGGTTGA TATTATGGTA 
23451 ATCAGTAACA AAGAAGAGTT 
23501 AAAGGATGGG GCCAAACATA 
23551 TTGTAGCTCT ATTAACAGAA 
23601 AGGAGTCCTG TTTCAGACGT 
23651 AAGGGAAATG AGCAGTAGGC 
23701 AGCTGTGAGC TCGACGTGTA 
23751 GAAGACAAGG AGTAGGCAAC 
23801 CTGATGATGA GACTTTTGCC 
23851 AGGAATGAAG GGAAGCAGCT 
23901 TACACACTGT CATGGAAGTC 
23951 TGGTGGACAG TATTACAAGC 
24001 GACTGCATTG ACCTTGTGGA 
24051 ATTTATTGTG GGTATATGGA 
24101 GCAGGTGGGA GTAGGGATGA 
24151 GGGAAGGGGC ACAGAGCATC 
24201 AAGCCTCCAT GAGAGTTCAG 
24251 TCTGGGTTTT GCATAGGCTT 
24301 GGCCACTGGT GATCAACTTA 
24351 GCCTTGGTCT TTCCAGTGAC 
24401 GCCAGCTATC AGTCAACTCT 
24451 GATTTTAGGA GTTGGCTGTC 
24501 TCACAATATC ACAGTCGTGC 
24551 TAAACAAAGA GGTGATAAAT 
24601 ATGACAGTGT GGTGAATAGA 
24651 ATGTTATATT CCCCTGACCC 
24701 CCCCCTACTC ACACAGAAAA 



GAGTAGGGAA ATAGAGGATA TGTTCAAGGA 
TTTGTTAGGG TTAGAGGAAA CACAGTGTTT 
AGTGCGTTAT GAAATCAATA TGGTGGTTAG 
GAAATAAATG GATGAGAAGA GAATAGAAAA 
TGAAAGAGCA AGTATTATTT TCTGCAACTT 
CTTATATTTT TATGTATGGA TGTGAATACC 
CTGT AGAC TG CAGTCAAAAA ATCTTTAAAG 
CTTATTTTGC AGAGGAGAAA TCCAAGATCT 
TGAGGTTATA GAACCAGCTT ATGCCATTGC 
AATTCTTTCC CACTAGTGCC ATACTGCACT 
ATACAGAATA TATTAGTGAA ACAGCATACA 
GGGTAGGTGG CTGAGAGCCT CATTTTCTAA 
GGTATGGTGG CTCACACCTA TAATTCCAGC 
GAAGATCGCT TGAACCCAAG AGTTCAAGAC 
GACCTCATCT CTACAAAAAA TTTAAAAATC 
GCCTGTAGTC CCACCTACCT GGGAAGCTAG 
AGGAGTTTGA GGCTAAGGTG AGCCATGATC 
GAGTGACAGA GGAAGACCCT GTCCCTAAAA 
ATTCCTTAGA CAGTACAGTC ATTAGTCATT 
TAATATGATC AGAAGAAATT TATATCACTG 
AGGAAATAAG AGACTAGAGT CAGGGATTCC 
GTTTTGAGAC AGAGTCTCTT TTTGTTACCC 
AGTCATGGCT CACCGCAGCC TCAAACTCCC 
GCTCGGCCTC CCAAAGTGCT GGAATTACAG 
TGATGTGGTC TGGCCTAGTG CCTCTCAAAC 
CCGGGAACCT GACTCAACAA GTCTGGGTTT 
TAACATGAGT CAAGCTGATG CAGATGGTGC 
AGCAGCAAGG ATCTAGTTAG CAATTAGTAA 
GTGACAATAA GAATGGAGAG GAATGTGAAA 
CACCTCTTGG TAATGTGAGC ATGAGGAGGG 
ACTGGTTTTG TGTTTGACTG ACGAGGAGAA 
ATAGGAGAAG AAGTTGGTTT GGAGAGAAAG 
GTTGAGGTCC CAGGTGAGAC AGGATCTCCA 
AACCTAAAAG GAAATCTGTG CTCAGAAGGG 
GATCTGAGGG TCATCAGCAC ATAGAGTTTA 
CAAA AGAGCA AATACACAAA GAGAGGAGGA 
TTTTAGGATG AGAAGAGGAA CAGGAAATGA 
TGTAGGAATG TAGAGCATCT GAAAAAAAAA 
AAGGGAAGAA GAATTTCAAG AAGGAGGGTA 
ATCAGGAATA CAGCTAAAAG TCATACTCTT 
TTTGTGAGGG ACACACTAAT AAATAAAGGA 
GGCACAAAGG AAGAGGTTAT CCAAATCAAA 
GTTCTCCAAG GTGGAGGCAT CAGTGAATGT 
CATGCCCATC CCAGGCAAGC CACCCTCCAG 
CTATCCAGAA GGTCTCTGTA CCCTAATCTT 
CATTGTGTAG GCATGATTTA TTAAACTATT 
ACCTTCAACC CCTCTCCCCT CCCTAATCAT 
CAGTCCCTAT CCTAAGCTAC CCAATGGTCT 
ACAAAAAGAC ATCACTTTGG AGATTCTAAG 
AGGAATTTAG TTGAAGATCA AATATATATT 
TATTTTATAT CAGGCGCCAT TAAATGGTTT 
TCAGATTTTC I I I I I ATAAA GCTTACACTG 
TTGGGATGAG GGCAATACTT I I I I I I I GAA 
TACTTTCTCC TTG I I I I CI I CTACCTCTCT 
CTTCTCTCCC TCTACTCATT CCCTGAATGC 
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24751 TGGTGTCTGT TAAGGTTCCA GCCTTGACAG TGAGGCTAAT CAGAACCACA 
24801 GTGGTACAGA TGTGAGATGA TGGTGGGAGA AAGTGGACAG ATATGAGACC 
24851 AATTACTTAG CCGGAACTGA CGGGAAAAAC AAGAGTCAGC GATA I I I I I I 
24901 TCTGGATCTG AGTATTAAAA TGGATGATGG TGCCATTCAC TGTGATAGAG 
24951 AATCAGAAAG AAAAATTTAT TTTGGAGAGA TACCATGAAT TGTGTTTTAG 
25001 ACATGCTAAG TTTGAGGTGA TTATGGGATG TACAGGCGAG CTCCAGACTG 
25051 TGTGGGCCTA AAGTAGAAAG GCAATCTGAG TTGGAGATAA AGATTTTGAA 
25101 ATCATCAGAA TACGGTTGTT CATTAGAGCA CTGTCAGTGG GTAAGATAGC 
25151 TAAGGGAGCA TGTGTAGAGT GATAACAGAA GATCAAAGAC GGAACCCTAA 
25201 GAAT AACAA T ATGTTATTAT TTATTATTTT ATTATGTTTT A I I I I I I AAT 
25251 TTTA I I I I I A TTTATTTATT TAI I I I I AGA CGGGAGTCTC GCTCTGCTGC 
25301 CCAGGCTGGA GTGCAGTGGC GCAAACTCAG CTCACTGCAA CCTCCGCTTC 
25351 CTGGGTTCAA GGGAGCCTCC TGCCTCAGCC TCTCAAGTAG CTGGGACTAC 
25401 AGGCACCCAC CACCTCACCT GACTAATTTT TGTA I I I I I A GTAGAGACGG 
25451 GGTTTCACCA TGTTGGCCAG GCTGGTCTTG AACTTCTGAC CTTGAGTGAT 
25501 TCACCTGCCT T GGCCTT CCA AAGTGCTGGG ATTACAGGTA TGAGCCACTG 
25551 TGCCTGGCCT A I I I I I GTTT TTTATAGAGA TGGGGTCTTG CTATGTTGCC 
25601 CAGGCTGGTC TCGAACTCCT GGACTCAAGC AATCCTCCTG CCTTGGCCTC 
-25651 TCAAAGTTCT G GGAT TACAC ATGTGAGTCC CTGCGCCTGG CCAGAATATC. 
25701 AATATATTAG ATTTTAGTAG AAGTAGAACC TATGAAAAGA ACAGCCAGAG 
25751 GGGCAGAAGA AAAATTAGGA GATTGTGGAA CCAAAAGAAG AGAGTGCCTC 
25801 AGGAAGGAAG GCATGGTCTA TGATGCCAAA TGCTGCAAAG ATAAGGAATA 
25851 AGAAGTATCC ATTGGGTTTC ATAGGAAAAG TCATGGGAAA CCATGGTAAA 
25901 AA AACATT GT GAATGACACA ATCGTTGCAA AAGCA I I I I I ATAGGGGGAT 
25951 GAATTTTGTA TTTCAGAGGA CAAACAGTTC CATACAATGG CAAGATCTAG 
26001 TGTGTGACCA CGGGAGTTAG TGTCTGAAGT GGATTGGAGA AGCAGATCAT 
26051 TGGAGCTGAG GTTGGCTAGA GCTGTTCTCA TGGACACTAA TGTCATGGAG 
26101 TCAACAGCTG TGATCCAAGT GCCCACATCT TCAGTGAATG ACAGAGAGGG 
26151 ATTGAGAGTT CAGTGAATGA CCGCTAAAAG AAGAGTAATG GAAGATGTGG 
26201 CTGGATGGCA TTAAAATCCA AGGGACAGGG G I I I I I ACTT AAAAGTAGAG 
26251 AAGTAATGGT TTTGAAGTGG TAGTGGGGAA AAGGGAGGCA GCTTATGACA 
26301 CTTGTCAGTG GTCAAAGGTA TGAGGAAGTT ATAGAAAAAC TAACATCCAC 
26351 TTGAGAATAT TATAGGGAAG CAGTGAGCTC AAGGTCTCAT TTAAGGAAAG 
26401 GAGCCAAAAG GAAATTCACC AGAGGTTAGC TTTTAGGTAG I I I I I AAAGC 
26451 AGGATTGAAG AATGGAGACT AAACAGTGAA AATGTTTGGG AGAGAGAGGA 
26501 GCAATAGATA TGAGGCTAAA CAGAGGAAGC ACAGAACAGA ATGGAGATGA 
26551 GTATGTTGGG AGGAAAAGGA ATAGTCAGAG GCTTATATTT TGAGTTGTGA 
26601 CCAAGGAAGA CAGGGTGGGA ATCCTCGTGA GGTTATCTTG TTTCAGATTT 
26651 CTAGTAGAAT GAGTCCCAGG GATTCCAGGG GGGATGGAAG GACTCAGGCT 
26701 TCCCTATAAG GAGTTGGCTA ACGGATCTCA TTGGTTTTTG AGTAACTCCT 
26751 GGCCCAGATG GCACTAGTTC AATGGAATTA TTTTGTTCCC CCAAAACTTA 
26801 TTGAGTTGGA AACAGGTCTA ACTCCTGGGA TCTGGGAAGC CTTTCTGGAA 
26851 AGAGTCACCC ACGATCTGGC TGATGTTGAA CTGTGCAGAC ACCATCATAT 
26901 TTGGTTATGT TAGGATGCAA TAATTGGTGA AGCTTCTGTA GTGTTGAATG 
26951 AAGAATCCAG GTTGGAAGGG ATGAAAGGGT GAGTGGGTGA TGAGGTTTGT 
27001 CAGCACAGAC TGCAATTTTG AGAAATGTGG TTATAAAATA CCATACCTTA 
27051 ATACCGCAGT GCTTTACCAC TCACAAATGC CTGTAGACGT ATCTGGCAGA 
27101 GAGGAAAGGG GTTGAATGGC AAGAATGTGG GAAGGGACTG TGGCTAGTTA 
27151 GTGAAAATAG TCTACACTTG GGACATAAAA GGCATTTCAA GCTGACCTAC 
27201 TA AGAA GCTC TGTCTCTGAC TCAGCCAGCT GGCTCTCTCC TTCCCTGTCA 
27251 TGTTTTCATT TTCTGTCTTT TCTCTAGTTT CTCAGGATGG TATAGTGGAG 
27301 TCAGACAAGT CTGAATTTGA GTCTTGGCTC TGACTATTCC TAGACATGTT 
27351 TTAAAAGTTA CATTGAGCCC TGGTTTTCTC TGTAAACTGA GGATAAGCAT 
27401 GCTATCCCAA AGGTTGTATC CCTCACTGGT CACCAGCTTC CTGTCTTCTA 
27451 TCCACCTGTC TTCCTCTTCC TCTTTCCCTA GTCCTGCATA TTGAAAAACA 
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27501 I I I I I I I I I I I I I I I GAGAT GGAGTCTTGC TCTGCCACCC AGGCTGGAGT 
27551 GCAGAGGCAC GATCCTGGCT CACTGCAACC TCTGCCTTCC AGGTTCAAGC 
27601 AATTCTCCTG CCTCAGCCTC CCGAGTAGCT GGGATTATAA GCATATACCA 
27651 CCACATCTGG CTAAI MUG TAI I I I IAGT AGAGATGGAG TTTCACCACA 
27701 TTGGCCAGGC TGGTCTCGAA CTCCTGACCT CAGGTGATCG GCTCGCTTTG 
27751 GCCTTCCAAA GTGCTGGGAT TATAGGCGTG GGCCACTGCG CCAGTCTGAA 
27801 AAACGTATTT TTAAGCACAT ACTATCGTAT CTTCTTGTCT TTTACCTGGA 
27851 ATTTAAGCTG GTTGTTTGTA TTACC I I I I C CATGGACATT TATATTTATA 
27901 ACCAATCAGA AGGTTTAAAT GTCAGTGTAG GAATTTTGTG CTATGGAAGC 
27951 TTCGTGGCTT GGTGAATGGT AAAATGAATA ATGTGTGTAT ATTTGAAGCA 
28001 TCAGAAAGAG AAAATGCTGG GAAGATTCAT AGAACCAGTT AACATTTGAA 
28051 CTAGGAGTCA TAAGAAATTT TTAAAATTCT TAAATGGTTT ATGAACCTGA 
28101 TGTGGTAGCT ACATGAAACC TGCATAGCTG CAGGTATGCT ATGGTAGGTA 
28151 AACTCTCCAT GCTCCTGCTT CCATTGGACC ATTTGGCTCC AATGTCTCCA 
28201 GGTCTTTGTT AGATCAATAC TGGTCCTAGC ATCTCTGAAA GTCCTAGCTT 
28251 TCTAAGATGC TGTTGAAAAA GAGGATTAAT CCACATAACT CTGCATCTGC 
28301 CATTTTGCCC ATGTCCCAGG AATGCTGGGC CTAGCCCTTC CTTTCTGAAC 
28351 TGCCAGAACA CGTTCTCAGT TGACATACGT CTTTGTAAAT ACTGATGTTG 
28401. GTGTTTGAAT TCTCAATTGC CAATGGCACT GGAAAATAGC AAAAGATACT 
28451 TGGAATACTA AGCATTCTTT TTTTCCGGTA AGTTTCTGTA GTGATGGGAA 
28501 CCTAGTAATG GCTTTGGTTT CTGTGCCTCA TAAGCACATG AAACA I I I I I 
28551 AATTTGGGGC TCAGAATGTG I I I I I CCCTT TTATTTCTGC ACCACTACCA 
28601 TTTACCCTTT CTCCCTTCTT CCTCCTACAA TTTGTTCCTT ATTCI I I I I I 
28651 GA I I I I I I I I GAGGGGGGGG GGTCTAACTT ATTTTGGTCT CTCTTCCCTT 
28701 TTCATCTGTA CTGTGTATTT CCCTTGTTTT CAACTTTGAA TTTAAGACTT 
28751 TAAAAATAGC TTTAAAAAGA TAAAGATTTC TTTATTTTCT AATACCATCT 
28801 AAAGATATAT I I I I I AGTGT GGTCTCCTTG TGTTGTGTTT TTAAAAGGGT 
28851 TTCATATTGG AGAGCCTGGA AAACTTAAGC AGTTGTAAAC TTTAGAATAT 
28901 CATTTCCAGG TCAACTTTGA TCTTATATGC CAAGTTCATC GGTGGGGAAA 
28951 AAAATTAAAT CTTTCACATC TAAATCAATA ACTAGTGTTC CAAAGGAAAC 
29001 TTCAAAGTTT CACTTTAGAT TTTTAAAGAA GGGTAATTCC TTCAGTATCA 
29051 AAGAAATGAG ATGTCAGGAA AAGCCAGAAT CCCTTTGTTT AGGACACAGT 
29101 CTAGTTACTT GAL I I I I CTT GTCC I I I I I C TTCCCCCTCT GAATGTAAAA 
29151 ATCTTCTTCT TCTTC I I I I I I I I I I I I I I I TTGGTCTCTC AAGAGACACT 
29201 TTTACTATAT TCTTTGAGAT GACTG I I I I I GATTTAGAGG CGAAATCAGC 
29251 ACGTGGTGGC TCAAATCTCC TTATGGATAG TGTTTCTTCC TTCCAGCTTT 
29301 TCATGTTTCA ACTTTTGCGG GGCCTGGCGT ACATCCACCA CCAACACGTT 
29351 CTTCACAGGG ACCTGAAACC TCAGAACTTA CTCATCAGTC ACCTGGGAGA 
29401 GCTCAAACTG GCTGATTTTG GTAAGTCGCC CCTCGGGTCT CATTCTGGGC 
29451 TGTGAACAAT GATGCTTTTG TGTGCACTTG TTTAAGCGTT GACTGGGCCT 
29501 GGCCTTTGAA AACTGGAGGC CCAAGAACAT GATGCTTTGT GAGGATATCA 
29551 AACTACCACA AAGGAAGTGT GAGGCACGAA ACAGGGAGGG ATTGGTAGCT 
29601 TTCTAGGATT CCACCAAGTC CCAGTTTAGT CAGATGGCCA AAAGCTGGGC 
29651 ACCCTTGCTG CCCCACTGCC AGTTTTGATA TAGAGACATT GGTAGAGTAA 
29701 ACTGTACTTA GTAAGTTTTC CTAAATCTAA GTGAATATAC AAATTATATT 
29751 GGAATAGATT GAGATTATCC CAAGATGATA AAGAGGTTAA CCCCAGATTG 
29801 TAGCATGGAC TCCTGTCAGG ATGGAGACTC CAGGACACTT GTTCCTGCTC 
29851 TCCTACCTTC TTTATATAAG TGTGAGATGC AAAGTTTTAT TCCCATTAAA 
29901 GTGAAGCAGA TTTCCTCTAA GTATCACTGT ATCCTTCCAT TTTAGCACTT 
29951 ATCGCAGTTT ATAATTATAT TCACACACAT AAATACATAC ATGCATACAT 
30001 ACAAATATAT ATACATGTGT GAGCACACCC CCACACACAA ATATATATAG 
30051 ATTTGCGTGA TGATTTTGTC TCAACTGGAC TGTAAGCATA ATGAGGGCAG 
30101 CCTGGGTTTG I I I I I GCTTA TCATTTTATC CTTAGTGCCT GGTACCATAG 
30151 TAGGTGCTTA ATAAGTACTT GTTGAAAAAC TGGCTCTATG TGAGCTAAGG 
30201 AACCACTCTT CTCTGTTTGG CAGATGCCAA ATGGTGATAC TATCACTGCA 
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30251 GTATTTATTC TGAGATGGCA GCTTTTATCC TGACATGTAA GCATTTAACA 
30301 GATATTTGTT TATCAATTCT CCACAATAGC AAACTCATCT ATTGAAGTTT 
30351 TTCCCAACAA TAGATCATGC AATTCTGTGA GATAAACAGC TGACTGACAG 
30401 AAAGACTCAT TTTGCAGAAC AGTACTTAGA AATTCATCTA AGGTCCTACC 
30451 AAACTAATTA ATTTGGATGA GCAGTCCCTA CCGTTTATCT ACTAAACTGG 
30501 GCTTTCCTGG AGTGCCAAAA CGGAAGGTGG CCATGTTAGT CATGAACAGC 
30551 TCAGTTTCTG TTACAGAGAC CCAAAATTAC AGAGGTATAA CATGCTAGAA 
30601 ACTTAACTTT CTTTCGCATC ACAGTCCTGA CCT AAGCAGG CAGAGCATGT 
30651 ATGGTGGCCC CATGCTATCT TGGCCCAGGC TGCTTCTGTC ACGTGGCTCC 
30701 TCCATCCCCA ATTGTATGTT TCAAGATGGC TGCCACTTCC TGCTCATCAC 
30751 AGCCCAGAGG AGGGAGAAAA GAGAAGCAGA ACCCTTAACC CCTCCACTAA 
30801 GGCATAATCT GGAAGTTCAC ACATCACCTC TGTTCATATC ATATAGGCAA 
30851 GAACTTAGTC ACCTGACCAC ACCCAGCTGC CAAGAAGGCC ACATCTAGCT 
30901 GCAAAGCAGG CCAAAATTTG AGAAATTCAC TTGATGAAGT GATAGACAAG 
30951 AGTCAA GATA GTGATTAGTT CTACTAAAAG CACCTAAAGT TTGTGTGTTA 
31001 I I I I I I CTAA TGGTGTTTAC CCTGGTCCAG TGCATCATGG TGCAAGCCAA 
31051 GGTCCAGAAC GATGGGTTTT ATGCTTTTCC CTTTTGGACA GGTCTTGCCC 
31101 GGGCCAAGTC CATTCCCAGC CAGACATACT CTTCAGAAGT CGTGACCCTC 
31151 TGGTACCGGC CCCCTGATGC TTTGCTGGGA GCCACTGAAT ATTCCTCTGA 
31201 GCTGGACATA TGGTAAGAGT GGTGCCGAGA AAATGTGAGT CATCCTACTC 
31251 ACGAGGGTTG CTTTATCATC TACATTATAT TTTAATAATA ATTCTAAAAA 
31301 TGGCAATCAC GTATATATTT TTATATATAT TTATATTTAT ATATTTTATA 
31351 TATATTTATA TAGTTATATA TTTATATTTT ATATATTTAT ATATTTATAT 
31401 ATAT TTGT AT ATATTTATAT ATTTATATAT TTTTATATAT TTATTATATT 
31451 TATA I I I I I A TA I I I I I ATA TATTTATATA TATTTTATAT ATATTTATAT 
31501 ATATATTATA TATATTTATA TTTATATATA TTTATATATT TATATATATT 
31551 TATATATTTA TATATATTAT ATATTTTATA TATTTATATA TTATATATAT 
31601 TTTATATATT TATATATTTA TATATTATAT ATA I I I I I I I ATATATATAT 
31651 ATATGTATTT I I I I I I I I IG AGATGGAGTC TCACTCTATT GCCCAGGCTG 
31701 GAGTGCAGTG GCACGATCTC AGCTCACTGC AACCTCCACC TCCCAGATTC 
31751 AAGCAATTCT CCTGCCTCAG CCTTCTGAGT AGCTCTACTA AAAAAATACT 
31801 AATATTTGTA GAAGATTCTT GCAATTATTC TATAACCTTT TACTGTTGAA 
31851 CTGA GACCCA CAGA GTTCCT GCCCAAGGCA TCTTCTGAAT CTGACACTCT 
31901 TTTTATGTTA TTTTA I I I I I TGAGATTGGG GTCTTGCTAT ATTGTCCAGG 
31951 CTGGTCTTGA GCTCCCAGGC TGAAGCAGTT CTCCCACTTC AGCCTCTTGA 
32001 GTAGCTGGGA CTATAGGGCT GCACCACTGC ACCCTGGCAA TCTCATGCTC 
32051 TTTCTTTCAC GCCTTTCCTC CTAGCTCCTC TCTTTAATCC TTTGCCTTGT 
32101 CTTCTCCTTG ACACCTTATC CACAGAGAAA CAAACATATA TCCCCAAACC 
32151 ACAGACACAC AGATGTGTGT GCACGTGCAT GTGCATGCAC ACACATCTGC 
32201 ATGAACATAC TCACACATGT CCAAACGTAG TTCAGAGCCT GGTTTAGGAA 
32251 AAAAAAAAAA AAGCATAAAG ACCAAGCTTC AAGACACCTG ATTTTCATGC 
32301 CAGTTCGATT TCTAATCAAT TAACTCTGGA TTCTGTTATC TTGAAAAAGT 
32351 CATGTATCCT CTCTGTGTCT ATGTTTCTCC A I I I I I AAAA ATGAAGGTAA 
32401 TAAACTCTCT CCATCTGAGT TAAATGGAAT TGTAGTACAA ATATAAGAAC 
32451 CAAATAGGTG GCTGGGCTTG CCGTCTCATG CCTGTAATCA CAGCGCTTTG 
32501 GGAGACCAAG GCTGGAGGAT CGATTGCTTC AGCCCAGTTG TTTAAGATCA 
32551 GCCTGGGTAG CACAGTGAGA TGCTGTCTCT ACA I I I I I I A AAAAAATTAG 
32601 TCAGGCGTGA TGGCTAATTA AACACTTCAG GAGGCTGAAG TAGGAGGATC 
32651 TCCTGAGCCT GAGAAATTGA GGCTGCAGTG AGTTTTGATG GTACCCCTGC 
32701 AATCCAGCCT GGGTTACAGA GCGAGACCCC GTCTGAAAGA AAGAAAGAAA 
32751 CAGAGAGAGA GAG AG AG AG A GAGAGAGAAA GAAAGGAAAA GAGAAGGAGA 
32801 GGGGAGAGGG GGAGAAAGGG AGAGGGGGAG AGAGGGGGAG AAGGGGAGAG 
32851 GGGGGAGAGG TGGGGAGGGA GGGAGGGAGG GAGGAAGGGA AGGAAGGAAG 
32901 GAAAGGAAGG AAGGAAGGAA GGAAGGAAGG AAGGAAGGAA GGAAGGAAGG 
32951 AAGGAAGGAA GGAAGGAAAG AAGGAAAGAA TCCAGATAGG TGCTATCAAG 
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33001 TAAAGCCACA GAGTTGGGGA GGCTCTAAGG TTAATGGGTT ACAATAGTGA 
33051 GCATGGGCTG TCAGACATGC ATCATCCTAG AACGGCAGTG TTATTTTCTC 
33101 TGGATCATGT TCCTGGAGAC TTCCCAGTCA TTTGGGGGCC ACTGTTAGAT 
33151 ATGTGATGAC TTTACAGACG TAGACAACTC CCCAAAGGTA AGGAAATATA 
33201 TGAA TCTCT T TCAGTACCTT GGAAGAAAGG GTTTATATAA AAACACAAAG 
33251 CCCCATTTTC AAAAATCCAT AATTGATTTT AAAAAATTAA ATGGTGTCCT 
33301 AAAAG GCTAA ACTAAGCTTT TAGATCTCCC AAAGAATTAA GAAAGGTTGC 
33351 AGACAI I I I I CTCCAGTGTA GAGTCATTGA TTTCTGATAC CCAGTACAAT 
33401 TTATAGAAAT ATCATCTGCT AGTCAAAACC CTCCTGAAAC TGTCAGCTCA 
33451 CACCGCTCAG CACTGTCACT TCAAAGGACT CCGGCAGGCT CTGGCTTACT 
33501 CAGCTCTTAA TGATGTCTTC CTGATTATGT TTCACAGAGT GAAACTTCTA 
33551 CCCGTCAATT TTAAACTAAT TTTATTATGG AATAGTTAAA ACATTCAAGA 
33601 GTATATATAA CATATATGTA GATCAGTGAT TCTCAACCAG GGAGCAATTT 
33651 TGCTCTGCAG GGGACATTTG GCAATGTCTG GAAACATTTT TTGTTTTCAC 
33701 AGCTGGGGGT GGGGTGGTGG GGGGTATCAC TGGCATCTAG TGGGTAGAGA 
33751 CCAG GGATA C TGCTAAACAT CCTACAGTGC AGAGGACAGC CCCTGCAACA 
33801 AAGA I I I I I C CAACCCAAAA CATCTGTAGT ATCAAGATTA AGAAAGCCGA 
33851 TGTAGGTTAA GAAGCTTAAT TTACTTTTAG AGACAGGGTC TCCCTTGGTT 
33901 GCCCAGGCTG GAGTACAGAG GTGAGATTGT CTCACTGCAG CCTCCAACTC 
33951 CTGGGTTTAA GTGATCCTCC TGCCTCAGCC TCCTGAGTAG CTGGGAATAC 
34001 AGGTGTGTGC CACCACACCT GGCTAATTAA AAAAAAAAAA GTGTAGAGAC 
34051 AGAGTCTCAC TTTGTTGCCG ATGCTGGTCT CAAACTCGTG GCTTCAAGAG 
34101 ATCCTCCTGC CTTGGCCTTC CCAACTGCTG GGATTACAGG TATAAGCCAC 
34151 CGTGCCCAAC CAATTAAGAA GCTTAATAAC GTGAACTTCA TAACCTGCTA 
34201 CCCAGTGTAA CAACTAGAAC ATAATCCGTA CTGTCCTATC AACTGTGTCC 
34251 CTTTCCCATC AACCTGCCCC TCCACTAGAA GGCCTTCTAC CAAAA I I I I I 
34301 TTTCCI I I I I TCATCAGTAT TCTCATATCT TTTTAAAAAT AATCCTTTTA 
34351 CATTT TAGAG GTATTCTTAA AAATA I I I I I TTGTTTTACT TGATTTTAAG 
34401 GGTTG I I I I I TTTTGAGACG GAGTCTCGCT CGTCGCCCAG GCTGGAGTGC 
34451 AGTGGTGCGA TCTCAGCTCA CTGCAAGCTC CGCCTCCCAG GTTCACGCCA 
34501 TTCTCCTGCC TCAGCCATGA TGTTATATTG CTTCTAGTCT TCTGTGACTT 
34551 GGCTTTGTTT CATTCAATAT GTTACATGTT TCTAAGATTC ATCCATGTTG 
34601 ATCTGTTTAG CTATACTTTA TTTTCTGTTA GTGAATATTT CA I I I I I I I I 
34651 AATGTCTATA GCTTTGCAAT AATACTTGAT ACCTTGTAGG CCAAGTCTCC 
34701 CAGCCTATTC ATCTTCTTCA TGAGGATACA TCAGATAAAC CTAGTTTAAG 
34751 GGACATTCTA CAGAGTAACT GACCTGTACT TATTGGAAGT GTCAAGATTT 
34801 TAAAAGATAA AGACTGAGGA ACTGTTCCAG ATTAAAGGAG ACTCCAGAAA 
34851 CCTGCCAACT AAATGTAACG CATGGTCCTA GATTGGATCT TGGGGGAGAT 
34901 GGTGCTCTAA AGAATACTGT AGGGACTATA GGTGAAATTT CAGTAGGGAC 
34951 TGTGGATTAG ATAGGGGTAT TGGATGAATG TTAAATTTCC TGATTTTGAT 
35001 AATTGCACTG TTGTTATGTA AGAGGATACT TTGGTTCTCA GAAAATACCA 
35051 ACATAATTAT TTAGGGATGA AGAGTCATGA TATCTACAAT TTACTCCCTA 
35101 ATGTTTCAGA AAAGATATAG ACAGACAGAC AGACAGACAG ACAGACAGAT 
35151 AGATAGATAA AATAACGAAA CAAAAGTGAC AAAATATTGG CGATGGATGA 
35201 ACCTGTTTGG AGGATATAAG AGAGTTCTTT ATACTGCTGC AACTTTTCTA 
35251 TAAGTTTGAA ATTATTTCAA GATTAAAAGT TGCCTCCAAA TTGCGAAATC 
35301 CTTGCTGTTT CATCAAAGTT AGTGTAAGAC AGCACTAGCC TAATATGTGA 
35351 TCAGTGTTTG TAATTTCTTC ATGTGTGTTT GAGAAGAATG TGTGTGTCCA 
35401 CCC AAATG TT GAGTGCTGCT GGGG I I I I I I TTTTG I I I I I G I I I I I GTTT 
35451 TTG I I I I I I I TGAGACAGAG TCTCACTCTG TCTCCATGCC TGGAATGCAG 
35501 TGACTCAACC TCGGCTCACT GCAACCTCCA CCTCCTGGGT TCAAGCGATT 
35551 CTCCTGCCTT AACCTCCCAA GTAGCTGGGA TTACAGGAGC ACACCATCAC 
35601 ACCCGGCTAA I I I I I GTAGT TTTAGTAGAG ACGGAGTTTC GCCATGTTGG 
35651 CCAGGCTGGT TTCGAACTTT AGATGTCAGG TGATCAGCCT CCCAAAGTGT 
35701 TGGGATTACA GGCATGAGCC ACCGCGCCTG GCCAAGTACC CATTTTTACA 
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35751 TATGTTCAAA AATTCAAGGT 
35801 ATT A I I I I IG TC I I I I I AAC 
35851 ACTATATTGT TGATTTGTCA 
35901 TGAAGCTATT TTCACTAAGG 
35951 TTTGTACACT TTAGTTACCA 
36001 TACATC I I I I AGAGTGTTTT 
36051 TGGCAAATTC TTCGI I I I IG 
36101 GGTGGTTTTG TTGGGGATGT 
36151 AACTTTGAAG ATGTCATTCC 
36201 GAGTTTGCTT TTAGCCTTAT 
36251 TCTGGATGTT TTAAAGACTT 
36301 CTCTAGGAGT TGTCCAGTGT 
36351 ATCTTGTGTT TCTTCCATTT 
36401 I I I I I GAATA TTGACTCTAT 
36451 AGACTATTAG ACCACATTCT 
36501 ATTTTCCATT TCCTTAACTT 
36551 CTCATCTACC AGTTCTTTAA 
36601 CAAAAGTAAT TGCAG I I I I I 
36651 GCTTTTGCAT CAACCTATAT 
36701 GTATGTTATT CTATGAATAA 
36751 TGACACAATA TAAAAATGCA 
36801 TATTCATTCC ATTTGATTGT 
36851 TAGAAATAGA ACAATATGCA 
36901 CCTCA AAGTA ATTGCTAGAC 
36951 CCCAI I I I I I TCI I I I ICTT 
37001 GAGAAGATTT CCAGTTGACA 
37051 TGCTAGGCCA TGCAACCCCA 
37101 GTAGATGACA CTGGGTGAGC 
37151 ATAACCCGTA GTGCAGAGCC 
37201 GTGTCAGTAG GGGGCCAGAG 
37251 AAAACTTGCA ACTAGGACAT 
37301 TTCTCTCATC TTCTGACGTC 
37351 AGACCTTAAA AGATTCCATT 
37401 TAGGATTATT AAATGGTGTG 
37451 TACATAAAAT TTACCATTCA 
37501 ATTCAGTACA TTCACATTGT 
37551 AACTTTTCAT CATCCCAAAC 
37601 CCCATTACCC CCTCTCCTCA 
37651 CTTGATGAAT TTGACTATTC 
37701 AATACCTCTT CTGTGTCTAG 
37751 TTCATCCATG TCGTAGTACA 
37801 CACAAGGTCA TCTATTCAGT 
37851 CTAGAGAAAG TTGTCTCTGC 
37901 CTGCAAACTT CCAAATATAA 
37951 ATGTCATGTC ACTGCCATAT 
38001 AAGAAATGAA ATACTGATTC 
38051 TTGCACCTCC AAAAAGCCTA 
38101 GGACTGCCTA AGGAATCTGA 
38151 GTTCATGCTC AGAATAGAGA 
38201 AAACATCTAA GCAGTTTCCT 
38251 AATAAAATGA CCCAGCACAT 
38301 TGTCACAGGG TCTTTTCTCC 
38351 GGGCTGCCCC ACGGGTATCT 
38401 AAGATATGTA GGGTAAGAGG 
38451 CAGAAAAATA AAAGGAAATA 



TGCTAATTAT ATTATCCAAA TCTTCTTTAT 
CTACCAATGA AAGGTGTGTT GAACTCATTC 
GAATTCT ATC CACTTTTGCT TTATATGCTT 
GCAAATAAAT TTAAGACTGC TCATTATTCC 
CTTTCAGAAT AATTTTCATT TCTCCTGAAA 
GTTTTGTTTG TGTGTGTGTA GGCCTGCTGG 
TTTTCAGAAG ATAAACCCTA ATTATTGAAA 
GATTCTAGAC TGACAGTTAT TTTCTCTCAG 
CCTTCTTTGT CTTCCATTGT TGCTGTCGAG 
TATCTTCCTT TTGCAGGTGA TCTCATTTTC 
I I I I CI I IGC CTTTATGATT ATGCAGTTTT 
GGAI I I CI I I TTACTTACCC TGTTTGGTAT 
GTGAATTCAT GTCTTTCATC AGCCATTTTC 
TCTATTCTCT CTCTGTAGAG CTCCAATGAA 
TCTGTTATCC ATTTCTCTTC TCTCCTTCAT 
TCTGTGATGC ATTCTGGGTA ATTTCTTCAG 
GTCTCTCTTA AACTATGTAT TAGGTTGGTG 
GCCATTAAAA GTAATGGCAA AACCATAGTT 
CTCTTACCTT TTTACCACAT ATACAAAAAT 
GTGTTTCATG AATTTAACCA TGAGCAACAA 
GTTATAAGTC AAAATTATTG TTATTACTCT 
TGTTTTCCTG GTAAAACTAA AAATGTAATG 
TCTTCCATTG AGCTCACTAT ATTTGTTTAC 
CTTGGGTATT TACACTGAGA TCCCTCTCCT 
TTCAGAGTGA TAAGAGGGGA AGTGAGAAGG 
AAGAATGAAA AAGAAAGAAT AATCCTATTC 
TAGGGTCCAA AGTGAATGCC CTTGTAGGAG 
ATTAGTGCAT TTGTCTTAAA GAAACCAATT 
TCTCCTTCAC AATGAGGCCT GGTGGCAGCA 
CAAATAAACA GGGGCTCTAG TTAATTATGG 
ATTG GTTATT CCCAAAGCTC CCAACCAACA 
I I I I C I I CTC TCTCTTTCTG CTACCTTTTC 
AGTGACTTTA GTGAGAAAAA TGCAATATTT 
G I I I I I AGTT TTTTGTATTG TGTTAAAATA 
TCACGATTTT CAGGTGTACA ATTCAGTGGC 
TGTGTAACCG TCACCACTGT CCATCTCCAG 
TCAAACTCTG CACGTATTAA ATGATAATTT 
GTCCCTGGTA ACCACGATTC TGC I I I I I AT 
TTGGTACCTC ATATAAAAGT GGAATCCTAC 
CTTGTTTTGC TTGGCATAAC ATTTTCAAGG 
CTGAGTTTTC CAGAAGCATT TATTTCAGTA 
ATCAGTTTCA GGCAGCTGCT GGTGTTAGGA 
CTAACAGATC ATTTACTGTC ACATTTCTCG 
AAAGGGTGGT CTAGAGAAAA GCAAGTGAGA 
ATTACGTTAT TCTGAATTAA CTTCAACAGT 
ATTTCTCCCA ACAACATTTT GATATTCTCC 
AAACTCCCGA GATGGATTTT TTTTCTCCAG 
GGAATCTTTC CCCCTCTTAT GGAAGAATTT 
AAAAGTAGGA GGAGAACCAG AAAGAGGAGA 
CTAACTTGAC TGAAGAACCA CATTTGGAAC 
CTCTCCCTTC TGGAAGGGTT TAATGTTTGA 
CCTGCATATG AATTTCCCCT TCGTCTACAC 
CCACACAGCA GAAATCCTCA GAGAAGCTTA 
AGCCCCAGGA ATGAAGATTT AAGGACAAAA 
GAAGCTGGTT CCCTATCTGG ACTTGAATGT 
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38501 TCAGAATATT TAAAATGTTT 
38551 AGATGATAGC CACATGACTT 
38601 AAAAGAAACA GTTCTGAACA 
38651 TCI I I I I I I I TATTAGATGG 
38701 AGTGGTGCGA TGTCGGCTCA 
38751 TTCTCCTGCC TCAACCTCCT 
38801 ATACCCGGCT AA I I I I I GGA 
38851 GGCCAGGCTG GTCTCGAACT 
38901 AAAGTGCTAG GATTACAATG 
38951 AAACTTACAG TGTGGGCTTG 
39001 ACCTATGGAT AGTCCATGTA 
39051 TTATAAGTGT TCTAAAGACT 
39101 ACATAGTGTG TGTGCATGTG 
39151 GATTTGTTAG ATGAGGAAGG 
39201 ATTTCAACCA TATAAACCAA 
39251 ATCTTGTGTT ACTTTATGGT 
39301 TTGCCAATAT ATTTTACATA 
39351 ATAAAGGACA CTAAGCAGGA 
39401 TATGAAAACA CACCATGAAA 
39451 GCI I I I IGGG GGAGAAAAGA 
39.501 TCA I I I I I I G AAACTTGTTT 
39551 GAGCTCCTGG GGTTTCTGGA 
39601 TCATGCCTTT AAAAGCCAAT 
39651 ATAACCCAAT TATGCATGTA 
39701 ACTCCAAAAC AAAATGGTCT 
39751 GTCATGGGGG GCAGGGTGGA 
39801 TCTCAGTTGG CTTATCTCTT 
39851 TTAAGCATTT GACAGTTTTC 
39901 TCATTTACAC CCTTAGTATC 
39951 CATGGATATT TAAATACAAG 
40001 CTACAAATTT ACCTCAAGAT 
40051 TGATCAAAAT TCTTGTGTCA 
40101 TCAGTAGAAA AGAAAAAATA 
40151 TCCCAGTACT TTGGGAGGCC 
40201 TCAAGACCAG CCTGGGCAAC 
40251 TAGAAAAGAA AAAGAATAAA 
40301 AAAAATCAGC AATAAGGCCG 
40351 ACTTTGGAAG GCTGAGGCAG 
40401 CAGCCTGGCG AACATGGTGA 
40451 AGCCAGGCAT GCTGGTGCAT 
40501 GGCAGGAGAA TCACTTGAAC 
40551 TTGCACCACT GCACTCCAGC 
40601 AAAAAAGAAA AAAAAAATTG 
40651 GAAAAACCTG CTTGCAAAGC 
40701 CTAAGAACAA GCCAGTCAGT 
40751 AGGG I I I I I G TATA I I I I I A 
40801 TCTTTTAATA TGTATTTCTG 
40851 ACTTGATCGT CAAGAAATCT 
40901 TCTAAACAGT GTTTGTCAGG 
40951 AGCATTTTGG GAGGCCTAGG 
41001 GACCAGCCTG GACAACATGG 
41051 AAAAAAAGAC CATATGTCTG 
41101 TAAACACATA TGAATAATAT 
41151 ATAAA I I I I I AATAATATTA 
41201 TTCTTTATCA TGCCACACAA 



GCTTTAAGAA TAGTCTGTGG TGGGCAAAAT 
GTATTCCTAA GGGTAAGAAG CAAATTAAAA 
GAAATGAAAA AATAAGATAA ATTGCATAGT 
AGTCTGGCTC TGTCGCCCAG GCTGGAGTGC 
CTGCAACCTC CAACTCCCCG GTTCAAGTGA 
GAGTAGCTGG GATTACAGGA ACACACCACC 
I I I I IGGTAG AGACGGGGTT TCACCATGTT 
GACCTCATGA TCTGCCCGCC TCGGCCTCCC 
CTTACACCTA GAACAGATCT GTCACCTTTC 
TTTTGTTATC AATGCATTGA TATTTACAGT 
CTGAAATAAA ATTGATTTAG GAATTTTGTC 
TGCACAAGTG CACACATACA CACACTATAT 
CGTGTATATA AATGAGTAAC CTTAGACTTA 
TTTCAACCTT CCCCAAAATG CAAATGGAGA 
ATATTGGCAT TTTATCTCTG GAACACAAAC 
ACTTACGTAA TGGCCTGAAT GCTCTAGTTT 
ATTTTGTATA CAAGTTTAGT GGTATAGAAG 
TTAACAGCTT GGTTCCCTAC AGCTGTTAAG 
AGGCAACAAG CTTCTTCCAG GCAATGGAAG 
AAGTGAATTA CAGGTTTAAA CCTAGGAATG 
AAAATATTTT CAATCCTTCT AGTGGTTTGT 
AGGTGTTTGG GAACTGGATA GAGGGTTAGT 
ACATTTCCAT TTCTCTTTTA TAACCAAGTA 
TTTTATATAC ACAGACACGT ATTTA I I I I I 
GAGGCCTTTC AAGAAAGTGC ATGTGGCGAA 
GACCATTTGG TGGTGCCCAC TAACTAGGTT 
AGTGGACCAT TGCTAGCAAC CAGGGTGTTT 
CATCA l I I I I ATTTGCCTTC ATATATTGTT 
TCTTGTTTTA AAGACAGGAG ACAAAAAGAA 
TTAATGAGGA ACTTTAAAAT AATAATAATT 
ACTTTACCAA ATTCATAAGT TACATTTATC 
CATATCAAGA TGTTTCTTAT ACAGCAGAAA 
GGCCAAGCGT GTGGTGGCTC ACACCTGTAA 
AAGGCAGGAG GATTGCTTGA GGTTTGGAGT 
ACAGTGAGAT CCCATCTCTA TTAAAAAAAT 
ATGGGGCTGT TATATCCAAA TTGGL I I I I I 
GGTGTGGTGG CTCACACCTG TAATTCCAGC 
GCGGATCAAT TGAGGCCAAG AGTTTGAGAC 
AACCCTGTCT GTACTAAAAA TACAAAAATT 
GCCTGTAATC CCAGTTACTC AGGAGGCTGA 
CTGGGAGGTG GAGGTTGCAG TGAGCTGAGA 
CTGAGTGACA GAGTGAGACC CTGTCTCAAA 
GCAATAAAAA CAACCTGTTG CTTGCTGGAG 
TCAGTCTGAT ATCAI I I I I I AAACAAAACT 
TAAGCTAAAA CCAAATATTT GATTATGAAA 
CAGGATAAGA TACAAATAAA TTTCAGTCTT 
TTCCCAAACC AGACACAAAG CAA I I I I I AA 
GTTTTCTCCT ACACAATCAA TGAAAAGTAA 
CCAGGCACAG TGGCTCACAT CTGTAGTCCT 
CAGGTAGATT GCTTGAGCCC AGAATTTCAA 
CGAAACCCCA TCTGTATTAA AAAAAAAAAA 
CAGTCAGATG GAAAAAGTAA AAATATGTAA 
TAAGGACCAT ATTTTAAAAT AAACTTGATA 
TCTACGATAA AATGTTTTAC TTAAATTTCG 
AAATGGCAAA ATGATTAAGA GAGTTTGCAA 
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41251 AATTATGTGG TATAGTGAAA GAGGTTTGCG GTTAAAAAAA AAAAAGAGAG 
41301 AGAGAGAGAG AAGTATGGGG CCATGGGGAT AGTCTCTGTA ATCAGTCACC 
41351 TGAACCACTT TTAATACTCA AAAGACTTAT GAGAATAAAA ATCTGATTTT 
41401 TGCTAAGATT TATTAGCAAA ATAAATCTTA CTCCTTCCTG TCCCTCTCTA 
41451 ATTATCCTTC AGCTTGACCA TGTATGAAAG AAAATTTACA TTTCACTGTT 
41501 TAATCTATTT AAAGATGAAC ATTTCCCATT AAATCAGGAT GCACCTTATA 
41551 ATCAGTAGCA TCTAACAATA TAAGTCAGCC AGGCTGCAGT TGTGACTGTA 
41601 GTTAGAATTG CACATGTGTG AACATCAAAT GAGCCAGCAT CAAAACGTGC 
41651 AGAATGGCCA GGCACAGTGG CTCACACCTG TGATCCCAGC ACTTTGGGAA 
41701 GCTGAGGTGG GTGGATCACT TGAGGTCAGG AATTCAAGAC CAGCCTGGCC 
41751 AAGATGGTGA AATCACGTTT CTACTAAAAA TACAAAAATT AGCCAGGCAT 
41801 GGTGGCAGGT GCCTGTAATC CCAGCTACTT GGTAGGCTAA GTCAGGAGAA 
41851 TCGCTTGAAC CTGGGAGGCG GAGGTTGCAG TGAGCTGAGA TCGCACCACT 
41901 GCACTCCAGC CTGGGCGACA GACCAAGATT CCACCAAAAA AAAAAAAAAA 
41951 AT TGCAGAAT TGGTG TCAGC GACTTGGAAG AAAATTCTGC AAAGAAAAGT 
42001 CUMMIN TC I I I I I M I TTTAAACTCC TAGGAACCAA ATGGTTGTGG 
42051 AGAAGGAGTA AATCAGACAT GTTTAGCAAC ATTCTTTAAG CAGGAGTCAA 
42101 AAGTAAGCTA ACACTACATA ACTGCAAGGC CAGCTTAGGA GCCCAGGACC 
42151 AATGACTCTC TGTTGTTTTA TGGATTATTT TAAGAAATGC TGCATCATCA 
42201 AATTCTTAAT ATAGAGGATG ATACATGGGT AAGTGTAGAC ATCAAAGAGT 
42251 CT GAGT CAAA TGCTGAATGT GAAAAAGTTT TAGGAATACC GAAACCAATT 
42301 TATTTTGCTT AATGTTTCTC I M I I CGTGT ACAAGTATGC TATATGAGAA 
42351 AATAATCTCT ATTTAATTAA ATTTATAACA GCCCTTTCAA TAAGTATAAA 
42401 ATGAACATTC TGATCATGTC ATAGTTTAAC TTGCA I I M I TTGTCTTAAT 
42451 GGCAAAAAAC CAATGACGCT TCTTACAATG ATAGCATCTT AGACTCAATG 
42501 AAAAGTGGGG ATGAAATGAA ATTTGGGGAT ACAGTACTTT CCCCTCTTCT 
42551 CCTAAAACAG ATAATGAGCT TGAATGATCT ACAATGTTTG CTAACTCTAC 
42601 TGCTTTGCTA ACTGCTGCTC GTGGTGTTCC ATTTTAATAA AAAGCTGTGG 
42651 GCTGTTCTTA TTTTGTTTGA CATAGGGACT I I I I I I I IGG CCCAAGACTT 
42701 TTAATATCAT GTGGTCCGTA TTTAACTCTC CCTAAAATAT TTCTTGGGAA 
42751 GAGAAATTCT AGTAGTTCAG TTTCGCTTGT ATGATTTCTT TCAAAGTGTC 
42801 AATTTACTCT TATTTCCTTT GCTAGGGGTG CAGGCTGCAT CTTTATTGAA 
42851 ATGTTCCAGG GTCAACCTTT GTTTCCTGGG GTTTCCAACA TCCTTGAACA 
42901 GCTGGAGAAA ATCTGGGAGG TAGGAGAATA ATTCTTCTAA AGAAAATGAA 
42951 ATATCTGCAT TTTAAGTTTT GAACCAAATT TGCCTTACAG ACAAATGAAG 
43001 CAGTCCATCT GCTCTGAGAT ATTAAGCCCT ATATTAAGAT TGTAGAAACT 
43051 GTAGCATTTG CCACAGCTAT AAGCACCCTG GGAATGTGTG GTCAGGAAAC 
43101 TCCCTGTTGC CCCATAGCAG CCCATGAATC CAGCTCACTG AATGATGTTC 
43151 AGGTCTCCTG CTCCCTGTCA TTAGTATTGT CTTAACCTCC CAGGGCAATT 
43201 TCTGCCATTA CTACTCAGAC ATGTCCCTAC CTTGCTACCT CCAGTTCTAA 
43251 TGCTACCATA TATTTGGCCC TGGATCTTTG TCAACTGAAA ATAAGACATA 
43301 GAA M I I I AG CTGGGTGCAG TGGCTCATGC CTGTAATCCC AGCACTTTGG 
43351 GATTGCTTTG AGCCCAGGAG TTCGAGACCA GCCTGGGCAA CATGGCGAAA 
43401 CCCCATCCCT ACAAAAACAA AAATGAGTGG GCTGTGTGGC GCACACCTTA 
43451 GTCCCAGCTA TTCAGGAGGC TGAGATGGGA GGATCACTTG AGCCCAGGGA 
43501 AGTCGAGGCT GCTGTTAGCT GTGACCACGC CACTGCACTC CAGGCTGGGG 
43551 AACAAAAAAA AGACACAAAA TTTTCATAGA ACCCTGATAG AACAGAGGCT 
43601 TTCCCTCTTA GTGTGAAAGA AGTGTACCAT TTATCATGCT TATCCACAGC 
43651 CAAATTCCTA AAGTGTCAAG GTGCCTTTGT GTGTGTATGC AGCTCCATTT 
43701 CTTAATTCAT TATTTATCCC TACCGCAGTT GCCTATGATA TGCTTTGTTT 
43751 T TATGGCC CT TATATAGTAT TACAGTCATA CTATAGTCAT CTGTATATTT 
43801 CCTTTTTTGG TCATA I I I M ATTGTGGTAA AATATACAAA ACAAAATTTA 
43851 CCGTCTTAAC CCTCCTTAAG TGTACAGCTT GTCAGCATTA AATACATTCA 
43901 TATAGTTGCA CCACCATCAC CGCCATCCAT TTCCAGAACT TCTCTATCAT 
43951 CCCTAAGGGA AGCTCTGGAC CCACTGAACA ATAACTGCCC ATCTTCCCTC 
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44001 CCCACACTCC CCTAGCCCCT AGTAACCTCT AATCTACTTT CTGTCTCCAT 
44051 GAATTGGCCT ATTCTAGGTA CCTCATATAA GTGGAATCAT ACAAATTTGT 
44101 CTTTCCGTAT CTGGCTTATG TCACTTAGCA TATTTTCAAG GTTCATCCAT 
44151 GTTGTAGAAT GTGTCAAGGG GCTTTAAATC GGCGGGGTGC AGGGGGGTAC 
44201 TTTATTACTT GCTATCCTGG ATCCTGCTGC TTGTCTTCTG GCTAAAATAA 
44251 AA TGTAC TTT GTGAAATTAA GACATTTTAT AGAGATTAAT TACTGACATT 
44301 AAATTTTCTT CTAGAAACAT GGGGGCTATT ATGAAGGAAC ATGGGAAAAA 
44351 CTGGGAAGCA TTCACAACTG AAAAAAAAAA ATCCAAGCCA AAAGACTTTT 
44401 TCTAAAAACT TTCTTGCAAG ACAGAGCAAT GCTATCTTCA CATTATGTTA 
44451 TTGGGTGCTA TAACATCATC TAAGCTGGAG ACAGCCTACT GTCATAGCTT 
44501 TGGAGTCCAA AGACCTGGGT TTGAATTCTA ACCATTTTCT AGCT AAATGA 
44551 ACATGGGCAA GTTATGTAGT CCCTCTGAAC TTTCGTTTCC TTGTCTGTAA 
44601 AATGGCAACA ATGATAATAA GGACTTTCTA ATTCTTTATT GAGAATTCCA 
44651 TAAAAACAAA TGCATAACAA GCTCCATGCA CCATAAATGC TCAATAGATG 
44701 CTTGCTTTCT TCCTGTCCCA TACAAATTGT TGTACAGATG TTTCAATAAC 
44751 CTAACTGCTA GCAAGTATTA CCTGAAATTT AACCCGATTG TTCTCTTCTT 
44801 TCACTTAGCA GTATTATTTC TTGTCCACAA TAGAGGAAGC ACAATTGCAG 
44851 TTCTGATGCT GCAATGACCT TTTATACATT TGAAGAGTTT TTCCTGGTCA 
44901 TTTAATCAGG AAACAACACT TACTCACCAT ATATGAGGCG AGTAACTCTA 
44951 CAAGACTCTA CAAGGTCTTG TAAGAAGCTA TAAGCCAAGG GGGAAAAAAA 
45001 AAAGAAGAAT AAGAAAAACA CATGATCTGT ATTTTCAAGT GTTGTTCAGT 
45051 CTAGGTAGGG CGATGGGTGA AGTATACGTA AATATATGTG AAACAAACAT 
45101 AAACTATGTA TATATGTAAA AGGATGTATG TATAGATAGT TAATATAAAT 
45151 TGTAATACTG AAATAAGATG TGCTATTAGG ATACTTGAAG AGTAGTTTAT 
45201 TTGAAAAGAA TATAAGTATA TCCTTGTGTG CCATTAGTAT TTGAAGAGTT 
45251 GTATATAAAC TGA I I I I I I I TCTTTTTCCT I I I I I I I GAG AAGGAGTCTT 
45301 GCTCTGTCAC CCAGGCTGGA GTGCAGTGGT GCCATCTCGG CTCACTGCAA 
45351 GCTCCACCTC CCCAGTTCAA GCGATTCTCC TGCCTCAGCC TCCTGACTAG 
45401 CTGGAATTAC AGGTGCCCGC CACCACACCT GGCTAACTTT TGTAI I I I I A 
45451 GTAGAGACGG GGTTTCACCA TGTTGGTCAG GCTGGTCTCA AACTCCTGAC 
45501 CTCGTGATCC ACCCGCTTTG GCCTCCCAAA GTGGTGGGAT TACAGGCGTG 
45551 AGCCACCGCG CCCAGCCTCA TAAACTGATT TTTAAAATAC AATATACAGT 
45601 TAGGCATAGT TGTGTGTGCC TATAGTCCCT ACTGCTTGGG AGGCTGAGGC 
45651 AGGAGGATCC TTTGATCCCA GGAGTTTGGG CAACATAGTG AGACCCCCAT 
45701 CTCTAATAAT AATAAATATA AATTTCAAAT AACATTTTAA AATATGACAT 
45751 ACTATCTTTG AATGACCACA CAATTTAAAA AGCAATCATT TTACGGTTCT 
45801 TTAGTGTTCA GTTAGCACAG CACTTAGAAA TCATAGAATA AAGTGAGCAA 
45851 GATGCTTCTC AAAGCCTGAT CACTCTTTAG GACTCACAAT GGGCTAGGTA 
45901 CTATGCTGGA AAGAGAAAAA ATAATAATTT TCTAACCTGC TTGAGACATA 
45951 GTGGTATAAA TGATAACACA GCTGCTGAAC GTGATGACTT TCTCACTTTG 
46001 TCCGCAGAGC AAGAAACTAT AGATGCAGTA ACAAAACTGC ATTCAATGAA 
46051 CATGGGACTG TAGATAACAA ACTAACTTCA TTTCTTTGGG TACATGCCCT 
46101 GTATTGGGAT TGCTGGATCA TATGGTAGTT CCA I I I I I AA TATTTTGAGG 
46151 AACCTCCATA CCATCTTCCA TAATGGCTGT GCTATTTGCA TGCCCACCAT 
46201 CA GTGTG CAA ATGCTCCCTT TCCTCCACAT TCTTGCCAAC ACCTCTTTCA 
46251 TCI I I I I GAT AATAGTTATG AGGCAATATC TCACCATGGT CCTAGACTTC 
46301 ATTTGTCTGA TGACTAATGA TATTGAGCAT I I I I I CATAT ATCTCTTGGC 
46351 CATTTGTAGG TCATC I I I I G AGAAATGTGT ATTGAGGTTC TTAGTCCATT 
46401 CCTGCTACCA TAACAAAATC CCTTAGAGTG GGCATTTTAT AAAGAACAGA 
46451 ATTGGCCCGG GGCGCAGTGG CTCATGCCTG TAATCCCAGC ACTTTGGGAG 
46501 GCCAAGGTGG GTGGATCACC TGAGGTCAGG AGTTCAAGAC CAGCCTGGTC 
46551 AATATGGTGA AACCCCATCT CTACTAAAAA TACAAAAACT AGCCGAACGT 
46601 GGTGGTGTGC ACCTGTAGTC CCAGCTACTT GGGAGGCTGA GACAGGAGAA 
46651 TTGCTTGAAC CCAGGAGGAG GAGGTTGCAG TGAGACGAGA TCGTGCCACT 
46701 GCACTCCAGC CTGAGCAACA GAGTGAGACT TCATCTCAAA AAAAAAAAAA 
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46751 AAAAAAAAAA AAAGAACAGA AATTTATTTC TCACTGTTCT AGAGGCTGGA 
46801 AAGTCCAAGA TCAAGGCACT GTAGGCTGTT GTCCAGTGAG TATATTTGGT 
46851 CTCCAAGTTA GTGCCTTGTC GCTGCATCCT CCAGATAGGG CAAATGCTGT 
46901 GTCCTTACAT GGTGGAAGGG TAGAAGAGCA AACGGGCCTG ACTGATTCCC 
46951 TCTAGCTCCT TTATAAGGGC ATTCATCTCT GTCCTTGTGT CCTAATCACA 
47001 CGCTAAAGGT GGCTAAAGGC CCCACCTCTT AATACTGTTG CATTGGGGAT 
47051 AAAGTTTCAA CATGAATTAT GAAGAGAATA CAAACATTTA AACCACAACA 
47101 AGTCCTTTGC CCAC I I I I I I TTTGGAGACC GAGTCTCACT CTGTTGCCCA 
47151 GGCTGGAATG CAGTGGCTTG ATCCTGGCTC ATTGCAACCT CCACCTCCTG 
47201 GGTTCAAGCA ATTCTCCTGC CTCAGCTTCC CAAGTAGCTG GGATTACAGG 
47251 TGTGCACTAC CACACCCAGC TAATTTTGTA TATTTAGTAG AGACAGGGTT 
47301 TTACCATGTT AGCCAGGCTG ATCTCGAACT CTCGACTTCT GGTGATCCAC 
47351 CTGCCTCAGC CTCCCAAAGT GCTGAGATTA CAGGCGTGAG CCACCGTGCC 
47401 CGGCCCTTTG CCCACTGTTT AATGGGGTTG TCTTCTTGCT ATTGAGTTCC 
47451 TTATATATTT TTTATATTAA CCCCTTATCA AATGTATGGC TTGCAAATAT 
47501 TTTCTCCCAT CGTAGGTTGT CTCTTCACTC TAATGATTGT TTCCTTTGCT 
47551 CTGAAGACAC I I I I IAGTTT TATTTATTCC CATTTGTCTA TTTTCACATT 
47601 TGTTGCCTAT AAGCAGGTTA GAAAATTATA CAGATTATAA ATAGTTCCTG 
47651 AATTTGTGTT TTACTAAACG TAGCCTACAC AGATGAAAAC AGGAAAGCTA 
47701 CACTTCAGAA TCTGTGATAT TTGATGTCAG AAGTGCATCC CTGAAAGCAA 
47751 TGGGTCCATT CTAAATCTCC TAACCTCTAA CCATAATTTG TTCTATATTT 
47801 ATCCTGAGAT CTCACTCTTA GGAATAAAAA CACATTGAGA AGTCCTGAGT 
47851 CTCTATTTTA CTAI I I I I CT GAAGTGCCTG TAGTGTGTGT GTTTACATCT 
47901 AAATAA TAGC TGTCACCACT TTCTGATCAA TTTTAAAAAC TAATTTTAAA 
47951 TAAGTGTTTT TCATAAATAA TCCTGGATTT AGTTCTAAAA TCAGAATAAA 
48001 CTATGCAAAC TTTGAATCCA TTAATCAAAA TGCTTTTAGT TTCCATTCCA 
48051 ACAAAGGCAG ATAAACAGCC CCTTCAGACC ACTGTGGTTT GAAACATAGC 
48101 ACTC AC TGGC TGCCTTTTAA GAGCCTTCAG GGAGGGAGCA AAACAACAAT 
48151 TTTTGGTTTT CAGTTTCCCA GACAGTGAAG GAGAGATTTA GTAATTTTCT 
48201 CAAGTGAAAA AGAATTCAAT AACTTGCAAA TAGAAACTGA GATCAAATTT 
48251 CCAAATAAAG TATATTGAAT TTTTGTTTAA ACTTTTAAAA TCTCAAGCTT 
48301 AAAGCTTTGA ACATAAGATT AAAAAAACTT I I I I I AGTAT CCATTTTGTT 
48351 GGCTTTAGTT AAATATCATA CAAAGTAACC AACCATCTGG TAACTTTCAC 
48401 CTTAGAGAAA ACATGATAGT GGTTGTCACC TATTTCTTCT ATTGTTTTCT 
48451 CTTCATTATC TTTGCTTTCT TTTCACTGCA CTTTGCCAGC CAACAGAGGA 
48501 TGTATGGGTA CATGTGACTC ACACCCACTT GTTTACACAT GCATCTGTGC 
48551 AAATACATAA GATGGTAGGT TAAAAAAAGA AGAATTAGTT TCTTGTCCCC 
48601 TGGCCTTCTC CCACAAAAGA AGAATTAGTC CAGTTGGTTT TTCAAAATGG 
48651 ATTCCAGGAT TCTTAGTGTT CCCTCGGGCT CAGGGTGGTT GATAGGAAAA 
48701 GCCTATAATC CTCTCAGTCA CTTTTCAGTT TGTTTAGGGA ATGGATCAAA 
48751 GAAGGAAGAT TTTACTGGGT GGCATGATTT TTTTATTATA TGAGGGAAAA 
48801 TAGCACTTCA CTGTCTTTTG TTTAAAGACA AGCTTAACAG ATGCTAAAAA 
48851 GTACATCTCT CAGCCAGATT CCTAGTCAAC AAGCTGATAG ACACTAAGAT 
48901 TCTGGATTCT TCATTGATTA TATTCAGTCA TTGTTGGGCA ATTGACTCCC 
48951 TGCCATAATA ATTGGGCCAG TATCTATAAC CAGCATTTTA CAGATGGATT 
49001 CGCTAGACTC TTTCTGTAAG AGATGTTTCT AAAAAGAGTT ATAGTGAGAT 
49051 ATGC TTCTAA GAAAAGTTAT ACTGTAGTAG TGTAATGAAA GCTACTAGTG 
49101 TTTTATTAGT A TTTC ACAAG AACAATGTTA CTCTGTCTCC CATATATAAC 
49151 TGTCTATGGG CTTTTATGAT TATTCTTTAA AAAAAAAAAA TACTAAGGTA 
49201 ATGCCTACCG GGGAACTCAT GGTGCTGGCT TCATCCAAAG TCTGAGCTGT 
49251 TTTGGCTTTA TACTCCGAAA GACTTTATTT TCATACATCT TAACTAAAAA 
49301 CTGGGGCTTT AAATTGGTCA TTCAAGGCCA GGCGCGGTTG CTCATGCCTG 
49351 AAATCCCAGC ACTTTGGGAG GCCGAGGTGG GCAGATCACG AGGTCAGGAG 
49401 ATTGAGACCA TCCTGGCCAA CACGGTGAAA CCCCGTCTCT ACTAAAAATA 
49451 CAACAACAAC AACAACAAAA ATAGCCAGGC GTGGTGGCTT GCATCTGTAA 
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49501 TCCCAGCTAC TCAGGAGGCT GAGGCAGGAG AATGGTGTGA ACCTGGGAGG 
49551 CAGAGCTTGC AGTGAGCCGA GATCGCATCA CTGCACTCCA GCCTGGGCGA 
49601 CAGAGCGAGA CTCCGTCTCA AAAAAAAAAA ACATCGGTAA TTCAAAGCAT 
49651 AGACCAGCCC I I I I I CAAGT GATGTTGTTC CCATGACAAT CCATCAGTGA 
49701 AAAACCAAAT ACCATATTCC AAGCTGCTAG TCACAGAGAA AACAAGCAGA 
49751 TGAGATGAAT GTAATAGAAA AGACTAGAGT TAGTTTTGGG GTCATCTTTA 
49801 GCCAACATTC CATTGCCTGA AGCTCAGTAA TCTGAATCCT TTTTAATTTG 
49851 AGCACATCAG GGAACAGCTG AATACCCATG CTGAGGCATA ATTTAAGCTG 
49901 TCAAGTGTCT CCTGTCAATA TACATGTGGT CATCTGATGC AAGGCAAAGA 
49951 GACAGTCACT CCTGCTTCTT TATATCCCTA GCTCCCAACA TGGTGTCCTA 
50001 ATGCATGATA ATCATGCAGT AAATGTTCAG TGATGAGAAC ATGACTTTGA 
50051 GCAAGGCTGT ATGATCTGCC TCAGAACAAG TGAGTCAGTA AGAATGCAGG 
50101 CCCCGGACCA TAGGAATGTA TTACAGTTTT GCCCAAGAAA CCACAAACGT 
50151 TGGAAACACT CAAGTTTCTT TCTCGTATAC ATCAGCTGGT GTCATGCAAT 
50201 GGGACATACC ATCTGACGCT TCCCTGTTCT TCCCTGATTT GTCCTGCATG 
50251 TCTCCAATAC CTCTTTCCAA CCACCTCATC TCCCCACCTC ACCTTTCTTT 
50301 TTCTTTGTTT GGCTTTATAT AGGTGCTGGG AGTCCCTACA GAGGATACTT 
50351 GGCCGGGAGT CTCCAAGCTA CCTAACTACA ATCCAGGTAA TATTGATCTG 
50401 AGCTTCTGAA TACTCTGAGA ATTAGTAATG TAAGGAGAGC ATTGGCCACG 
50451 CTAACAGGGC GTTCTTGTAT TGTGAACTCA GCGGCAAAGA TGGGTGTAGA 
50501 GGAATTTCTA CATTCATATA TTCCCTGACT AATCTTTGTA TGAGGAAGAC 
50551 ACTGAAAGAG TAGCTGAGGT TAGACCAGTT • CCCCAGCTCT GTAAAACACA 
50601 AGTAGCAAGC TGAATAGAAT TTGAAATGAC TATTACTGTG GATTCCACAT 
50651 CC ATTGT CAA ATACCCAATG GCTCAAAAGA ACAACTCAAA AGATGGGCTC 
50701 ACTTTTGGGC CCCCTGACTG TCATAAGTGT ATTGATTAGT ATTGAATTGC 
50751 ATATGTATAA AAAGAAAGCT AATGCAACAG AACAGAGGTA GAGGCTCGCT 
50801 AGGCCTAGGA CATGCCAAGT AAGCTGTCTG TAGGTTATAC TTACTAAGAG 
50851 TTCATTCATT GCCTGTAAAC CTGACACTTG GTCATTGTCT CTCACACATT 
50901 TCATCTTTCA AGACTGGCTT CTGGGATCGA TTTAGAAGTG CTGGAAGTGT 
50951 TATCCATGGG GGAATTCTTT GAGAAGCTGT CGCAGGGCCA CATCAGAGGG 
51001 ATCAGATTAA GCAGTAGTCA CTTCAAGGAT GTTGAGACAG AGGGGAGGAG 
51051 ACAGGCACTG AACTACAGGA TGAAGGATCA TATTAGAAGC TGAAGAAGCA 
51101 AATAAAGCCC ATGCCAAAGC TGAGCTCTCA CTGGCAGGGT TGAAGGGGAG 
51151 G TAGAAA GGT ACAGATAACG ACAAGATTAG GGTGGATATG CTCCAAGCCA 
51201 GA I I I I I CTA GTCTTTATGG TCTTACATTG TTCCATTACT AAAAATGAAA 
51251 TCTT CCCAAA TTGTTGTCCT TAC I I I I I I I I I I I I I I I I I TGAGATGGAG 
51301 TTTTGCTCTT ATCGCCCAGG CTGGAGTGCA GTGGCACGAT CTCGGCTCAC 
51351 TGCAACCTCC ACCTCCTGGG TTCAAGCAAT TCTCCTGCCT CAGCCTCCCC 
51401 AAGTAG CTGG GACTACAGGC ACCCGCCACC ATGCCCAGCT AA I I I I I I GT 
51451 A I I I I I AGTA GAGATGAGGT TTCACCATGT TGGCCAGGCT GGTCTCGAAC 
51501 TCCTGACCTC AGGTGATCCA CTTGCTTCAG CTTCCCAAAA TGCTGGGATT 
51551 ACAGGCATGA GCCAGCGCGC CTGGCCTGTT GTCCTTACTA ACTTTGGTAT 
51601 GAGATTATCC TGGAAGGGTT TCCTGAGAGC AAGAAATTGT AGGTAGAGTT 
51651 AAAATGTGAT TAAAGAAGAG AATAAAATAC ATAGGGAGCT GGGGACTCTT 
51701 TTTCTTATTT TCTTTAGCAT CCAATACTTT TGCTTACAGC TATCCATAGG 
51751 GATCTGGCAT CTTGAACCAC CAGGATTATG GAAGCCCTAC AGCAAGCTAA 
51801 AGACTAACTG TAAAGTCCTT TCAGCTGCTT TGTGAATGGT TATATCTATT 
51851 GCTAAAAGGC CTTAATATCA TTTGCAAATA GTTTATGATT TCTAACTATT 
51901 TTTCTAGAGT TTAACACGTG AGAAAAATGC TACTCTCTGG TCACAGGACT 
51951 TAGAATAGTG CCTATTTCCA TTGGTCTGAG ATAGAGAAAA AAGAACAAGT 
52001 TTCTTGTGGA GCCGTGGTCC AGTCTGCAAA TTGCTCCTAT CTCCAGTTGC 
52051 CATGGTTTCC AGGAGAACGT GGCTCTCATC TTTTCCTGCC CTGCCTGTAC 
52101 TTCTCCCTGT CCACTCTGTT CTCTATTTTC CCTCAGCTTC CTAACTGAGG 
52151 ATGCCAGCAG AAGTTTAGAG TCACAGATGG ATTGTAGGAA ACAATTTGGA 
52201 TGATGCCAAT ACAAAGCTAC TGTGGTGGGC ATATGCTGCT CCCCCAAACT 
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52251 TCAGACATTT GGGTTTCAGG 
52301 TACAAAATGT CTTGGTGAGA 
52351 TCCCTGCCAG ATTGTGTGCT 
52401 AGAAAATATG TTACATTTTA 
52451 CAG I I I I I CT ATATAAGAAG 
52501 TCATCATCCA TTG I I I I I CT 
52551 AAAAATTTTC AGGTACCTAC 
52601 GCTACACAGA TGGAGAGGGT 
52651 GATTCATTCA CCCACACTCT 
52701 CATTTGTGAG CCCTGCTTCC 
52751 GGCTGAGCTT ATGGAGCAAA 
52801 GTCTAAACCT GGATTTGGAA 
52851 GGGGTCAGTA TTATTGATAA 
52901 TCTCCAACAT CTGCATTTCA 
52951 CCTTTTGAGC CACCCCCTAC 
53001 TAAACACTTG AATGTCAGTG 
53051 CAGGAAAGTG TCACAGATGA 
53101 TCCCATTCCT TTCCTTTTCA 
53151 TGAATCAGCC AGTACAATTT 
53201 GAACCATCCA GTGTCCATCT 
53251 TGGACACCTC TAGGAGGACA 
53301 AGTCACCTGA TGCTGACCCA 



TTGGT.CCAGG CAATCAACAG TGATCCTTAA 
GCAATAATCA AGAAACTTGG CCAAAGTGCT 
TAATAAGATA ACTGGGTTCC AATAAAACAG 
AAAAATTTTC TGTTGTTTCA AAACAATGTG 
AAAAGTCTCC AGGCCCAACA TCCATAGGGC 
TTTAAGTTTT CAATTTAATC CAAATAAGTC 
TATCTGCCAG GTGCTGTGCC GTGCGCTGGG 
GCATTCTTGG ATCTCTAGTG TTTGGGTTTG 
TTCACCAGTT CTCTTTGTTA CTGGGGTGCT 
ATGGCTTGGA GAGTTTGTGG CTGTGGGCCA 
GGGAGTTGGA ACCTTAGCCA TAGACATGAT 
ATCTTAAAAG TCCAGCCTAT CTTGGGCCAT 
CTCAATCCCA AGGACTGTGT TTTAAAAGGG 
GGAACATCCT CTTACGTGAG TCAATAAGTT 
CCATCCCCAT CCCTGAGCTG CTGTGGCTTC 
ATTAAGGGGA GCAGAAGACA AGCTGGGAGC 
GCACCGTGTC AGCAGCATTC TGGATGAGCT 
TTCTAAGTAG TCCTAGGAGC CCCCAAACTT 
TGAGGGAGTC CAGTTGTCCG GAACTTGGGA 
ACACCCATGC CTCCATTTCT AGGCCTTATC 
GCAAAGTTTC CATTTGTACA GCTTTTAAAA 
GTCGGATTTC TC (SEQ ID NO: 3) 



FEATURES: 

Start: 2118 

Exon: 2118-2240 

intron: 2241-2946 

Exon: 2947-3096 

intron: 3097-3310 

Exon: 3311-3405 

intron: 3406-7938 

Exon: 7939-8018 

intron: 8019-18260 

Exon: 18261-18355 

intron: 18356-19116 

Exon: 19117-19179 

Intron: 19180-29296 

Exon: 29297-29420 

intron: 29421-31091 

Exon: 31092-31212 

intron: 31213-42825 

Exon: 42826-42919 

Intron: 42920-50322 

Exon: 50323-50406 

Stop: 50407 



CHROMOSOME MAP POSITION: 

Chromosome 2 



ALLELIC VARIANTS (SNPs) : 



FIGURE 3T 



Docket No.: CL001098D1V II 
Serial No.: TO BE ASSIGNED 
Inventors: YAN, Chunhua et al. 
Title: ISOLATED HUMAN KINASE . 



DNA 



rOS 1 LI on 


Ma] Or 


mi nor 


uomai n 


O04 


c 

V- 


T 

T 


Beyond 






G 


Beyona 




c 


T 


Intron 


30/ 3 


A 

A 


G 


Intron 


3 /4/ 


a 

A 


G 


Intron 


3/00 


T 


G 


Intron 


8U34 


T 


A 

A 


Intron 


Z //4U 


G 


c 


Intron 


777 r -5 


C 


T 


Intron 




T 


C 


Intron 


3U/ / £ 


A 


G 


Intron 


303-LU 


T 


G 


Intron 


303£/ 


T 


C 


Intron 


*tUOJ-0 




L 


in rron 




1 




intron 


41044 


A 




Intron 


41311 


— 


G A 


Intron 


41 31 3 




A lj 


inrron 


44701 


C 


A 

A 


inxron 


46020 


T 


c 


intron 


46036 


A 


G 


intron 


46095 


T 


A 


Intron 


47608 


T 


G 


intron 


51949 




C 


Beyond 


52150 


G 


A 


Beyond 


52426 




T 


Beyond 


Additiona" 


1 SNPS 3' 


of the 


0RF (DNA pi 


sequence provided 


in U.S. 


Serial No. 


NO.CL001098-PROV, 


filed January 31, ; 


56707 




T 




57444 


A 


G 




58021 


A 


G 




58064 


A 


C 




59067 


T 


C G 




60034 


A 


G 




63709 


A 


T 




63817 


G 


A 




64845 


A 






64848 


G 






64914 


G 


T 




67367 


G 


A 




67497 


T 


A 




68252 


G 






68580 


C 


G 




69990 


G 


A 




71472 


A 


G 




71664 


T 


A 




71677 


A 


G 




72590 


T 


C 




72757 


G 


C 




72863 


A 


C 




74565 


C 


T 





0RF(5*) 
0RF(5') 



0RF(3') 
0RF(3') 
0RF(3') 

ositions refer to the genomic 

60/265,151, Attorney Docket 
2001) : 



FIGURE 3U 



Docket No.: CL001098DIV II 
Serial No.: TO BE ASSIGNED 
Inventors: YAN, Chunhua et al. 
Title: ISOLATED HUMAN KINASE ... 



74850 A G 

75462 G C 

76045 A G 

79973 G A 

83181 - T 

84935 C T 

87476 C G 



Context: 

DNA 

Position 

864 ATATTCTGAGAAACTAGCTTCTCACTCTCTCAGTTGTCAGTCAAAACTTTAATGGTCTTT 

GGCCGGGTGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGA 
TCACAAGGTTAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCTCGTCTCTACTAA 
AAATACAAAAAATTAGCCGGGTGCGGTGCCAGACGCCTGTAGTCCCAGCTGCTCAGGAGG 
CTGAGGCAGGAGAATGGTGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCCGAGATTGCGC 
[C.T] 

ACTGCACTCCAGCCTGGGCGACAGTGCGAGACTCTGTCTCAAAAAAAAAAAAAAAAAAAA 
GTTGAATGGTCTTTGAGCCAAGTAGTCTTCCTTTTCTTCTTCTTC I I M II I I I I I I I I I 
CAAAAAATATCTCTAGATTGAATCTTGGAATTGGCTTAAGTCTCTTCTCTTGTGGCAATT 
TTGAAATGAAAAAATACATGCTCATAATTAMTTACCTGAACATTTTAAAAAACCATCAT 
GAGGTTCAAATATCAAATATTCATAAATATTGTTGTGATAATAGACATAACTCTTATTTT 

2 111 TACCAATAGCACTAACTTGTGGCCAGAACAAGAACCTTAACTGTGCCAAATTTTATTCTA 
TTCAATAACAGCTGCCTCGTTTTCAGTTGTGCACATCTGAATGCAAGCAATCCCTGTCTG 
ATGTGGAGTTTCTTGCACTGATAAGGAAAAACTGCTGAAGTTGTGAGGCTGCTCCAGGCA 
GAGCCATCATGTGAGTCATATGAAAGCTCCACGCTGCTGACCTCTGGCAAAAAGGGAGAG 
AACAAGGATAGGAGAGGCAGTGGGGGAAAGGTTCAAGTGCGGGTTTTCTCCTTGAACCTA 
[C,G] 

AAGATTATGGGTCAAGAGCTGTGTGCAAAGACTGTACAGCCTGGATGCAGCTGCTACCAT 
TGTTCAGAGGGAGGCGAGGCACACAGCTGTCGGAGGAGTCAGCCTGAGACCACGGAGGCT 
GCGTTCAAGGTATTTGTATCCCAGGAGAGAGCATCTTTCTCTATTGATAAACCAAGGAGT 
TCAGACACTCCL I I II I GTAGCGGGATCTGATTCTTCTGCGGTAGGTCTAAACCAATAAA 
ATGAAAATTCTATTAAAGTCACAGAAAATTTATGGCTGTAGTTATCAAATTTGGGGAATT 

3259 AAAGAAGCATCATGTTCCATGACTTCATTTCACCCCAGGGGACTTCAAGCTGCCCGTGCC 
CAGAAGTTCAAGAGTAAAAGGCCACGGAGTAACAGTGATTGTTTTCAGGAAGAGGATCTG 
AGGCAGGGTTTTCAGTGGGTGAGTGAGCAGCTGATGTTGATCAAGAAGAATTTAATGTGA 
GCTTGTCTACGGAGGCCGGCCCTTGCTTCCAGGGCAATTACTGAGCGAGCCTTCCCAAGT 
CTGCTCTGGCAATGCTGTCTAATTTCCCTGGGGAAAAAAAGTCAACACTAAAAAAAAGTG 
[C.T] 

TCTTTCTCTCTTCCCTTTCACCCGCTCCTTTTCCCCATTCCCCTAGAGCAGAGGAAGAGC 
CTCCCTTTTGGGGCAGCCTCATCTTACTTGAACTTGGAGAAGCTGGGTGAAGGCTCTTAT 
GCGACAGTTTACAAGGGGATTAGCAGGTGAGTGACACATAGCTGGGAGAGACTTTAGAGA 
TGAGAGTCCCGCCCCCCCAATTTCATATTATAAAGCCAGGTGAGACATCATAGAAGTTCA 
TAGCACTCAGGACCTGTGCAAGACACCATGGCCGACAGGGAGAGAGACATGATAACTTAA 

3673 CTCTTATGCGACAGTTTACAAGGGGATTAGCAGGTGAGTGACACATAGCTGGGAGAGACT 
TTAGAGATGAGAGTCCCGCCCCCCCAATTTCATATTATAAAGCCAGGTGAGACATCATAG 
AAGTTCATAGCACTCAGGACCTGTGCAAGACACCATGGCCGACAGGGAGAGAGACATGAT 
AACTTAAACAGCCTTGAAAGAAAAACAAACCTGCCCTGCCCTAATTAAAATCAGCCCACT 
TAAATGTTTATCAGCCTTTCCCTTCTTGCATTCAATTCAGAGAATTCAAAGAAAATAGAC 
[A,G] 

TTCTCTACTACTGACCCAAAGAACAATTATCACTCTTCAGGCCTGTGGGAGGCACAGTTG 
GTAAAGCGTCTCTAACAGG I I I I I I ATATCCCTCCCTAAATCACAATGACAGAGTTTTGT 
AATGGCAACCTGGAATTTGCTGCTTCATTCCTCCACCTGGCCTTTATAGAAGAAACTGAA 
GTTGGTTTCTG CAAATTATGGTACATG CAAAAG ATG ATAAATCCTAG ATTTTTTATATTT 
GCAAAATACACAAAATGTCTGGAGAATAAAAATACTGCTTATCCAAAAGCTAAGTACTAA 



FIGURE 3V 



Docket No.: CL001098DIV I! 
Serial No.: TO BE ASSIGNED 
Inventors: YAN, Chunhua et al. 
Title: ISOLATED HUMAN KINASE ... 



CCCGCCCCCCCAATTTCATATTATAAAGCCAGGTGAGACATCATAGAAGTTCATAGCACT 
CAGGACCTGTGCAAGACACCATGGCCGACAGGGAGAGAGACATGATAACTTAAACAGCCT 
TG AAAG AAAAA C AAAC CTG C C CTG C C CT AATT AAAATCAG C C CA CTT AAATGTTT ATCAG 
CCTTTCCCTTCTTGCATTCAATTCAGAGAATTCAAAGAAAATAGACATTCTCTACTACT 
ACCCAAAGAACAATTATCACTCTTCAGGCCTGTGGGAGGCACAGTTGGTAAAGCGTCTCT 
[A,G] 

ACAGGI I I I I I ATATCCCTCCCTAAATCACAATGACAGAGTTTTGTAATGGCAACCTGGA 
ATTTGCTGCTTCATTCCTCOVCCTGGCCTTTATAGAAGAAACTC 

ATTATG GTACATGCAAAAG ATG ATAAATCCTAG ATTTTTTATATTTG CAAAATACACAAA 
ATGTCTGGAGAATAAAAATACTGCTTATCCAAAAGCTAAGTACTAATTTTGGTAAACAAC 
CAACTTTGTTAAATATATGTAAAAGATCCATGAATTCCCC I I I I AGTCAAGGTGGGAAAG 

CATAGAAGTTCATAGCACTCAGGACCTGTGCAAGACACCATGGCCGACAGGGAGAGAGAC 
ATG ATAACTTAAACAGCCTTG AAAG AAAAA CAAACCTGCCCTGCCCTAATTAAAATCAGC 
CCA CTT AAATGTTT ATCAG CCTTTCCCTTCTTGCATTCAATTCAG AG AATTO 
TAGACATTCTCTACTACTGACCCAAAGAACAATTATCACTCTTCAGGCCTGTGGGAGGCA 
CAGTTGGTAAAGCGTCTCTAACAGG I I I I I I ATATCCCTCCCTAAATCACAATGACAGAG 
[T f G] 

TTTGTAATGGCAACCTGGAATTTGCTGCTTCATTCCTCCACCTGGCCTTTATAGAAGAAA 
CTG AAGTTGGTTTCTGCAAATTATGGTACATGCAAAAG ATG ATAAATCCTAG A I MIMA 
TATTT G CAAA ATACACAAAATGTCTGG AG AATAAAAATACTG CTTATCCAAAAG CTAAGT 
ACTAATTTTGGTAAACAACCAACTTTGTTAAATATATGTAAAAGATCCATGAATTCCCCT 
TTTAGTCAAGGTGGGAAAGTTGGATGGTCGC I I I I I I CI I I ATGTTACTCCAATAGAGAG 

CACTTT AGTAA AGAGGGAAAATGCTTTGGAATATATATGTTATATATGTATTGATACATG 
TTAAA CI I I I I ATTTTG AG AAAATTATAG ATTTATATG CTAG AATATATTTTG AAGTG AA 
AGTGLI I I I GTTAAGCCATCTTTGGTATAAATTGCTGCTTTGAACCACCTCAATAAGTGT 
GTGCCCCTCAATCCCTCTCTTCTAGAATAAATGGACAACTAGTGGCTTTAAAAGTCATCA 
GCATGAATGCAGAGGAAGGAGTCCCATTTACAGCTATCCGAGAAGGTAAGAACAGCAGAA 
CT,A] 

TGGACCCAATAG ATCT GTTTTGAGTCCTTGATTTGGTAAAAAATGTATTGCATTGATCCA 
T TC AG C ATCTAGTTTTG ATTCTTCTGG AATACTATAATTAC A I I I I I A M M I CATACAA 
G I II I I CAAG AAATTT A C ACTG CT ATTTT ATT A CTT AATTTTG AG G AAATTG AG ATTT AA 
AACTATTATATCACTTGACCAAAACTATAAATTCACTGAGCAATTACTAATACTTTCCAT 
GTGTTTGGCCTCATGCTAGGTGCTAAGGCTATACCTATATAACCTCAGAAAATTCCTATA 

C CTGTCTTCTATC CACCTGTCTTCCTCTTCCTCTTTCCCTAGTCCTGCATATTGAAAAAC 
A I M I II I II I II II I GAGATGGAGTCTTGCTCTGCCACCCAGGCTGGAGTGCAGAGGCA 
CGATCCTGGCTCACTGCAACCTCTGCCTTCCAGGTTCAAGCAATTCTCCTGCCTCAGCCT 
CCCGAGTAGCTGGGATTATAAGCATATACCACCACATCTGGCTAAI I I I IGTAI II MAG 
TAGAGATGGAGTTTCACCACATTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATC 
[G,C] 

GCTCGCT TTGGC CTTCCAAAGTGCTGGGATTATAGGCGTGGGCCACTGCGCCAGTCTGAA 
AAACGTATTTTTAAG CACA TACTATCGTATCTTCTTGTCTTTTACCTGGAATTTAAGCTG 
GTTGTTTGTATTA CCTT TTCCATGGACATTTATATTTATAACCAATCAGAAGGTTTAAAT 
GTCAGTGTAGGAATTTTGTGCTATGGAAGCTTCGTGGCTTGGTGAATGGTAAAATGAATA 
ATGTGTGTATATTTGAAGCATCAGAAAGAGAAAATGCTGGGAAGATTCATAGAACCAGTT 

CCAC CTGTCTTCCTCTTCCTCTTTCCCTAGTCCTGCATATTGAAAAACA II II I M I II I 
TTTTGAGATGGAGTCTTGCTCTGCCACCCAGGCTGGAGTGCAGAGGCACGATCCTGGCTC 
ACTGCAACCTCTGCCTTCCAGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCGAGTAGCTG 
GG ATTATAAG CATATA CCACC ACATCTGG CTAA II II I GTA I I I II AGTAGAGATGGAGT 
TTCACCACATTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCGGCTCGCTTTGG 
[C,T] 

CTTCCAAAGTGCTGGGATTATAGGCGTGGGCCACTGCGCCAGTCTGAAAAACGTA I II I I 
AA GCACA TACTATCGTATCTTCTTGTC1TTTACCTGGAATTTAAGCTGGTTGTTTGTATT 
A CCTT TTCCATGGACATTTATATTTATAACCAATCAGAAGGTTTAAATGTCAGTGTAGGA 
ATTTTGTGCTATGGAAGCTTCGTGGCTTGGTGAATGGTAAAATGAATAATGTGTGTATAT 
TTGAAGCATCAGAAAGAGAAAATGCTGGGAAGATTCATAGAACCAGTTAACATTTGAACT 

TAGTCAGATGGCCAAAAGCTGGGCACCCTTGCTGCCCCACTGCCAGTTTTGATATAGAGA 
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CATTGGTAGAGTAAACTGTACTTAGTAAGTTTTCCTAAATCTAAGTGAATATACAAATTA 
TATTGGAATAGATTGAGATTATCCCAAGATGATAAAGAGGTTAACCCCAGATTGTAGCAT 
GG ACTCCTGTCAGG ATGG AG ACTCCAGGACACTTGTTCCTG CTCTCCTAC C I I C I I I ATA 
TAAGTGTGAGATGCAAAGTTTTATTCCCATTAAAGTGAAGCAGA1TTCCTCTAAGTATCA 
[T,C] 

TGTATCCTTC CATTTTAG CACTTATCG CAGTTTATAATTATATTCACACACATAAATACA 
TACATGCATACATACAAATATATATACATGTGTGAGCACACCCCCACACACAAATATATA 
TAG ATTTG CGTGATGA TTTT GTCTCAACTGGACTGTAAGCATAATGAGGGCAGCCTGGGT 
TTGTTTTTGCTTATCATTTTATCCTTAGTGCCTGGTACCATAGTAGGTGCTTAATAAGTA 
CTTGTTGAAAAACTGGCTCTATGTGAGCTAAGGAACCACTCTTCTCTGTTTGGCAGATGC 

30772 CAGTCCCTACCGTTTATCTACTAAACTGGGCTTTCCTGGAGTGCCAAAACGGAAGGTGGC 
CATGTTAGTCATGAACAGCTCAGTTTCTGTTACAGAGACCCAAAATTACAGAGGTATAAC 
ATGCTAGAAACTTAACTTTCTTTCGCATCACAGTCCTGACCTAAGCAGGCAGAGCATGTA 
TGGTGGCCCCATGCTATCTTGGCCCAGGCTGCTTCTGTCACGTGGCTCCTCCATCCCCAA 
TTGTATGTTTCAAGATGGCTGCCACTTCCTGCTCATCACAGCCCAGAGGAGGGAGAAAAG 
[A,G] 

GAAGCAGAACCCTTAACCCCTCCACTAAGGCATAATCTGGAAGTTCACACATCACCTCTG 
TTCATATCATATAGGCAAGAACTTAGTCACCTGACCACACCCAGCTGCCAAGAAGGCCAC 
ATCTAGCTGCAAAGCAGGCCAAAATTTGAGAAATTCACTTGATGAAGTGATAGACAAGAG 
. TCAAGATAGTGATTAGTTCTACTAAAAGCACCTAAAGTTTGTGTGTTA I I I I I I CTAATG 
GTGTTTACCCTGGTCCAGTGCATCATGGTGCAAGCCAAGGTCCAGAACGATGGGTTTTAT 

36310 T AG AG TGTTTTGTTTTGTTTGTGTGTGTGT AGG CCTG CTGGTG G C AAATTCTTCG I I II I 
GTTTTCAGAA GATA AACCCTAATTATTGAAAGGTGGTTTTGTTGGGGATGTGATTCTAGA 
CTGACAGTTATTTTCTCTCAGAACTTTGAAGATGTCATTCCCCTTCTTTGTCTTCCATTG 
TTGCTGTCGAGGAGTTTGCTTTTAGCCTTATTATCTTCC I I I I GCAGGTGATCTCATTTT 
CTCTGGATGTTTTAAAGAC I I I I I I CTTTGCCTTTATGATTATGCAGTTTTCTCTAGGAG 
[T,G] 

TGTCCAGTGTGGA I I I CI I I I I ACTTACCCTGTTTGGTATATCTTGTGTTTCTTCCATTT 
GTGAATTCATGTCTTTCATCAGCCATTTTC MM! GAATATTGACTCTATTCTATTCTCT 
CTCTGTAGAGC TCCA ATGAAAGACTATTAGACCACATTCTTCTGTTATCCATTTCTCTTC 
TCTCCTTCATATTTTCCA1TTCCTTAACTTTCTGTG ATG CATTCTGGGTAA I MCI ICAG 
CTCATCTACCAGTTCTTTAAGTCTCTCTTAAACTATGTATTAGGTTGGTGCAAAAGTAAT 

36327 TTTGTGTGTGTGTAGGCCTGCTGGTGGCAAATTCTTCG I M I I GTTTTCAGAAGATAAAC 

CCTAATTATTGAAAGGTGGTTTTGTTGGGGATGTGATTCTAGACTGACAGTTATTTTCTC 
TC AGAA CTTTGAAGATGTCATTCCCCTTCTTTGTCTTCCATTGTTGCTGTCGAGGAGTTT 
G CTTTTAG CCTT ATT ATCTTC CTTTTG C AG GTG ATCTCATTTTCTCTG G ATGTTTT AAAG 
AC II I I II C I I I GCCTTTATGATTATGCAGTTTTCTCTAGGAGTTGTCCAGTGTGGATTT 
[T,C] 

Mill ACTI^CCCTGTTTGGTATATCTTGTGTTTCTTCCATTTGTGAATTCATGTCITTC 
ATCAGCCATTTTC Mill GAATATTGACTCTATTCTATTCTCTCTCTGTAGAGCTCCAAT 
GAAAGACTATTAGACCACATTCTTCTGTTATCCATTTCTCTTCTCTC^ 
ATTTCCTTAACTTTCTGTGATGCATTCTGGGTAATTTCTTCAGCTCATCTACCAGTTCTT 
TAAGTCTCTCTTAAA CTATGT ATTAG GTTG GTG CAAAAGTAATTG C AG I I I I I GCCATTA 

40618 CCGGGTGTGGTGGCTCACACCTGTAATTCCAGCACTTTGGAAGGCTGAGGCAGGCGGATC 
AATTGAGGCCAAGAGTTTGAGACCAGCCTGGCGAACATGGTGAAACCCTGTCTGTACTAA 
AAATACAAAAATTAGCCAGGCATGCTGGTGCATGCCTGTAATCCCAGTTACTCAGGAGGC 
TGAGGCAGGAGAATCACTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCTGAGATTGCACC 
ACTGCACTCCAGCCTGAGTGACAGAGTGAGACCCTGTCTCAAAAAAAAAGAAAAAAAAAA 
[T,C] 

TGGCAA TAAAAA CAACCTGTTGCTTGCTGGAGGAAAAACCTGCTTGCAAAGCTCAGTCTG 
ATATCATTTTTTAAACAAAACTCTAAGAACAAGCCAGTCAGTTAAGCTAAAACCAAATAT 
TTG ATTAT GAAAAGGG I I I I IGTATAI I I I I ACAGGATAAGATACAAATAAATTTCAGTC 
I II C I I M AATATGTATTTCTGTTCCCAAACCAGACACAAAGCAA I II II AAACTTGATC 
GTCAAGAAATCTGTTTTCTCCTACACAATCAATGAAAAGTAATCTAAACAGTGTTTGTCA 

40928 AAACAACCTGTTGCTTGCTGGAGGAAAAACCTGCTTGCAAAGCTOVGTCTGATATCATTT 
TTTAAACAAAACTCTAAGAACAAGCCAGTCAGTTAAGCTAAAACCAAATATTTGATTATG 
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AAAAGGG I I I I IGTATAI I I I I ACAGGATAAGATACAAATAAATTTCAGTC I I I L I I I I A 
ATA TGTAT TTCTGTTCCCAAACCAGACACAAAGCAA Mill AAACTTGATCGTCAAGAAA 
TCTGTTTTCTCCTACACAATCAATGAAAAGTAATCTAAACAGTGTTTGTCAGGCCAGGCA 
[T,C] 

AGTGGCTCACATCTGTAGTCCTAGCATTTTGGGAGGCCTAGGCAGGTAGATTGCTTGAGC 
CCAGAATTTCAAGACCAGCCTGGACAACATGGCGAAACCCCATCTGTATTAAAAAAAAAA 
AAAAAAAAAGACCATATGTCTGCAGTCAGATGGAAAAAGTAAAAATATGTAATAAACACA 
TATGAATAATATTAAG GACC ATATTTTAAAATAAACTTGATAATAAATTTTTAATAATAT 
TATCTACG ATAAAATGTTTTACTTAAATTTCGTTCTTTATC ATG CCACAC AAAAATGG CA 

41044 TATGAAAAGGGTTTTTGTATATTTTTACAGGATAAGATACAAATAAATT^ 

TTTAATATGTATTTCTGTTCCCAAACCAGACACAAAGCAATTTTTAAACTTGATCGTCAA 
GAAATCTGTTTTCTCCTACACAATCAATGAAAAGTAATCTAAACAGTGTTTGTCAGGCCA 
GGCACAGTGGCTCACATCTGTAGTCCTAGCATTTTGGGAGGCCTAGGCAGGTAGATTGCT 
TGAGCCCAGAATTTCAAGACCAGCCTGGACAACATGGCGAAACCCCATCTGTATTAAAAA 
[A,-] 

AAAAAAAAAAAAAG AC CATATGTCTG CAGTCAG ATGG AAAAAGTAAAAATATGTAATAAA 
CACATATGAATAATATTAAGGACCATATTTTAAAATAAACTTGATAATAAATTTTTAATA 
ATATTATCTACGATAAAATGTTTTACTTAAATTTCGTTCTTTATCATGCCACACAAAAAT 
GGCAAAATGATTAAGAGAGTTTGCAAAATTATGTGGTATAGTGAAAGAGGTTTGCGGTTA 
AAAAAAAAAAAGAGAGAGAGAGAGAGAAGTATGGGGCCATGGGGATAGTCTCTGTAATCA 

41311 GACAACATGGCGAAACCCCATCTGTATTAAAAAAAAAAAAAAAAAAAG AC CATATGTCTG 

CAGTCAGATGGAAAAAGTAAAAATATGTAATAAACACATATGAATAATATTAAGGACCAT 
ATTTTAAAATAAACTTGATAATAAA I I I I I AATAATATTATCTACGATAAAATGTTTTAC 
TTAAATTTCGTTCTTTATCATGCCACACAAAAATGGCAAAATGATTAAGAGAGTTTGCAA 
AATTATGTGGTATAGTGAAAGAGGTTTGCGGTTAAAAAAAAAAAAGAGAGAGAGAGAGAG 
[" »G f A] 

AGTATGGGGCCATGGGGATAGTCTCTGTAATCAGTCACCTGAACCAC I I I I AATACTCAA 
AAGACTTATGAGAATAAAAATCTGA I I I I I GCTAAGATTTATTAGCAAAATAAATCTTAC 
TCCTTCCTGTCCCTCTCTAATTATCCTTCAGCTTGACCATGTATGAAAGAAAATTTACAT 
TTCACTGTTTAATCTATTTAAAGATGAACATTTCCCATTAAATCAGGATGCACCTTATAA 
TCAGTAGCATCTAACAATATAAGTCAGCCAGGCTGCAGTTGTGACTGTAGTTAGAATTGC 

41313 CAACATGGCGAAACCCCATCTGTATTAAAAAAAAAAAAAAAAAAAGACCATATGTCTGCA 
GTCAGATGGAAAAAGTAAAAATATGTAATAAACACATATGAATAATATTAAGGACCATAT 
TTT AAA AT AAA CTTG ATAATAAA I I I I I AAT AATATT ATCT A CG AT AAAATGTTTTACTT 
AAATTTCGTTCTTTATCATG CCACAC AAAAATGG CAAAATG ATT AAG AG AGTTTGCAAAA 
TTATGTGGTATAGTGAAAGAGGTTTGCGGTTAAAAAAAAAAAAGAGAGAGAGAGAGAGAA 
[-,A,G] 

TATGGGGCCATGGGGATAGTCTCTGTAATCAGTCACCTGAACCAC I I I I AATACTCAAAA 
GACTTATGAGAATAAAAATCTGA I I I I I GCTAAGATTTATTAGCAAAATAAATCTT ACTC 
CTTCCTGTCC CTCTCTAATTATCCTTCAG CTTG ACCATGT ATG AAAG AAAATTTACATTT 
C ACTGTTTAATCT ATTTAAAG ATG AACATTTCCCATTAAATCAG G ATGCA C CTTATAATC 
AGTAG C ATCT AACAAT ATAAGTC AG C C AGG CTG C AGTTGTG ACTGT AGTT AG AATTG CAC 

44701 TCTAAAAACTTTCTTGCAAGACAGAGCAATGCTATCTTCACATTATGTTATTGGGTGCTA 
TAACATCATCTAAG CTGG AGACAGCCTACTGTCATAGCTTTGGAGTCCAAAGACCTGGGT 
TTGAATTCTAACCATTTTCTAGCTAAATGAACATGGGCAAGTTATGTAGTCCCTCTGAAC 
TTTCGTTTCCTTGTCTGTAAAATGGCAACAATGATAATAAGGACTTTCTAATTCTTTATT 
GAGAATTCCATAAAAACAAATGCATAACAAGCTCCATGCACCATAAATGCTCAATAGATG 
[C,A] 

TTGCTTTCTTCCTGTCCCATACAAATTGTTGTACAGATGTTTCAATAACCTAACTGCTAG 
CAAGTATTACCTGAAATTTAACCCGATTGTTCTCTTCITTCACTTAGCAGTATTATTTCT 
TGTCCA CAATA GAGGAAGCACAATTGCAGTTCTGATGCTGCAATGACL I I I I ATACATTT 
GAAGAG I I I I I CCTGGTCATTTAATCAGGAAACAACACTTACTCACCATATATGAGGCGA 
GTAACTCTACAAGACTCTACAAGGTCTTGTAAGAAGCTATAAGCCAAGGGGGAAAAAAAA 

46020 A AATTT CAAAT AA C ATTTT AA AATATG A CAT A CT ATCTTTG AATG A C C A C A C AATTTAAA 
AAGCAATCATTTTACGGTTCTTTAGTGTTCAGTTAGCACAGCACTTAGAAATCATAGAAT 
AAAGTGAGCAAGATGCTTCTCAAAGCCTGATCACTCTTTAGGACTCACAATGGGCTAGGT 
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ACTATGCTGGAAAGAGAAAAAATAATAATTTTCTAACCTGCTTGAGACATAGTGGTATAA 
ATGATAACACAGCTGCTGAACGTGATGACTTTCTCACTTTGTCCGCAGAGCAAGAAACTA 
[T,C] 

AGATGCAGTAACAAAACTGCATTCAATGAACATGGGACTGTAGATAACAAACTAACTTCA 
TT TCTT TGGGTACATGCCCTGTATTGGGATTGCTGGATCATATGGTAGTTCCAI I I I IAA 
TATTTTGAGGAACCTCCATACCATCTTCCATAATGGCTGTGCTATTTGCATGCCCACCAT 
CAGTGTGCAAATGCTCCCTTTCCTCCACATTCTTGCCAACACCTCTTTCATC I I I I I GAT 
AATAGTTATGAGGCAATATCTCACCATGGTCCTAGACTTCATTTGTCTGATGACTAATGA 

46036 TTT AAAAT ATGACATACTATCTTTGAATGACCACACAATTTAAAAAGCAATCATTTTACG 
GTTCTTTAGTGTTCAGTTAGCACAGCACTTAGAAATCATAGAATAAAGTGAGCAAGATGC 
TTCTCAAAGCCTGATCACTCTTTAGGACTCACAATGGGCTAGGTACTATGCTGGAAAGAG 
AAAAAATAATAATTTTCTAACCTGCTTGAGACATAGTGGTATAAATGATAACZACAGCTGC 
TGAACGTGATGACTTTCTCACTTTGTCCGCAGAGCAAGAAACTATAGATGCAGTAACAAA 
[A,G] 

CTGCATTCAATGAACATGGGACTGTAGATAACAAACTAACTTCATTTCTTTGGGTACATG 
CCCTGTATTGGGATTGCTGGATCATATGGTAGTTCCA I I I I I AATATTTTGAGGAACCTC 
C ATACCATCTTCCATAATG G CTGTGCTATTTGCATG CCCAC CATC AGTGTG CAAATG CTC 
CCTTTCCTCCACATTCTTGCCAACACCTCTTTCATC I I I I I G ATAATAGTTATG AGG CAA 
TATCTCACCATGGTCCTAGACTTCATTTGTCTGATGACTAATGATATTGAGCA I I I I I IC 

46095 GGTTCTTTAGTGTTCAGTTAGCACAGCACTTAGAAATCATAGAATAAAGTGAGCAAGATG 
CTTCTCAAAGCCT GATC ACTCTTTAGGACTCACAATGGGCTAGGTACTATGCTGGAAAGA 
GAAAAAATAATAATTTTCTAACCTGCTTGAGACATAGTGGTATAAATGATAACACAGCTG 
CTGAACGTGATGACTTTCTCACTTTGTCCGCAGAGCAAGAAACTATAGATGCAGTAACAA 
AACTGCATTCAATGAACATGGGACTGTAGATAACAAACTAACTTCATTTCTTTGGGTACA 
[T,A] 

GCCCTGTATTGGGATTGCTGGATCATATGGTAGTTCCA I I I I I AATATTTTGAGGAACCT 
C CATACCATCTTCCATAATGG CTGTGCTATTTG CATGC CCACCATC AGTGTG CAAATG CT 
CCCTTTCCTCCACATTCTTGCCAACACCTCTTTCATC I I I I I G ATAATAGTTATG AGG CA 
ATATCTCACCATGGTCCTAGACTTCATTTGTCTGATGACTAATGATATTGAGCA I I I I I I 
CATATATCTCTTGGCCATTTGTAGGTCATCTTTTGAGAAATGTGTATTGAGGTTCTTAGT 

47608 GTTAGCCAGGCTGATCTCGAACTCTCGACTTCTGGTGATCCACCTGCCTCAGCCTCCCAA 
AGTGCTGAGATTACAGGCGTGAGCCACCGTGCCCGGCCCTTTGCCCACTGTTTAATGGGG 
TTGTCTTCTTG C TATT G AGTTCCTTATAT A I I I I I I ATATTAACCC CTTATCAAATGTAT 
GGCTTGCAAATATTTTCTCCCATCGTAGGTTGTCTCTTCACTCTAATGATTGTTTCCTTT 
G CTCTG AAG ACAC I I I I I AGTTTTATTTATTCCCATTTGTCTATTTTCACATTTGTTGCC 
[T.G] 

ATAAGCAGGTTAGAAAATTATACAGATTATAAATAGTTCCTGAATTTGTGTTTTACTAAA 
CGTAGCCTACACAGATGAAAACAGGAAAGCTACACTTCAGAATCTGTGATATTTGATGTC 
AGAAGTGCATCCCTGAAAGCAATGGGTCCATTCTAAATCTCCTAACCTCTAACCATAATT 
TGTTCTA TATTT ATC CTGAG ATCTCACTCTTAGGAATAAAAACACATTGAGAAGTCCTGA 
G TCTCT ATTTT A CT A I I I I I CTGAAGTGCCTGTAGTGTGTGTGTTTACATCTAAATAATA 

5 1949 TTA AAAT GTGATTAAAGAAGAGAATAAAATACATAGGGAGCTGGGGACTC I I I I I CTTAT 

I I I CI I I AGCATCCAATACTTTTGCTTACAGCTATCCATAGGGATCTGGCATCTTGAACC 
ACCAGGATTATGGAAGCCCTACAGCAAGCTAAAGACTAACTGTAAAGTCCTTTCAGCTGC 
TTTGTG AATG GTTATATCTATTG CTAAAAGG CCTTAATATC ATTTG CAAATAGTTTATG A 
TTTCTAACTA I I I I I CTAGAGTTTAACACGTGAGAAAAATGCTACTCTCTGGTCACAGGA 
[-.C] 

TTAGAATAGTGCCTATTTCCATTGGTCTGAGATAGAGAAAAAAGAACAAGTTTCTTGTGG 
AGCCGTGGTCCAGTCTGCAAATTGCTCCTATCTCCAGTTGCCATGGTTTCCAGGAGAACG 
TGGCTCTCATCTTTTCCTGCCCTGCCTGTACTTCTCCCTGTCCACTCTGTTCTCT ATTTT 
CCCTCAGCTTCCTAACTGAGGATGCCAGCAGAAGTTTAGAGTCACAGATGGATTGTAGGA 
AACAATTTGGATGATGCCAATACAAAGCTACTGTGGTGGGCATATGCTGCTCCCCCAAAC 

52150 TG CT AAAAG G C CTTAAT ATCATTTG CAAATAGTTTATG ATTTCT AA CT A I I I I I CTAGAG 

TTTAACACGTGAGAAAAATGCTACTCTCTGGTCACAGGACTTAGAATAGTGCCTATTTCC 
ATTGGTCTGAGATAGAGAAAAAAGAACAAG I I I L I I GTGGAGCCGTGGTCCAGTCTGCAA 
ATTGCTCCTATCTCCAGTTGCCATGGTTTCCAGGAGAACGTGGCTCTCATC I I I I CCTGC 
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CCTGCCTGTACTTCTCCCTGTCCACTCTGTTCTCTATTTTCCCTCAGCTTCCTAACTGAG 
[G,A] 

ATGCCAGCAGAAGTTTAGAGTCACAGATGGATTGTAGGAAACAATTTGGATGATGCCAAT 
ACAAAGCTACTGTGGTGGGCATATGCTGCTCCCCCAAACTTCAGACATTTGGGTTTCAGG 
TTGGTCCAGGCAATCAACAGTGATCCTTAATACAAAATGTCTTGGTGAGAGCAATAATCA 
AGAAACTTGGCCAAAGTGCTTCCCTGCCAGATTGTGTGCTTAATAAGATAACTGGGTTCC 
AATAAAACAGAGAAAATATGTTACA1TTTAAAAAATTTTCTGTTGTTTCAAAACAATGTG 

52426 TTTTCCCTCAGCITCCTAACTGAGGATGCCAGCAGAAGTTTAGAGTCACAGATGGATTGT 
AGGAAACAATTTGGATGATGCCAATACAAAGCTACTGTGGTGGGCATATGCTGCTCCCCC 
AAACTTCAGACATTTGGGTTTCAGGTTGGTCCAGGCAATCAACAGTGATCCTTAATACAA 
AATGTCTTGGTGAGAGCAATAATCAAGAAACTTGGCCAAAGTGCTTCCCTGCCAGATTGT 
GTGCTTAATAAGATAACTGGGTTCCAATAAAACAGAGAAAATATGTTACATTTTAAAAAA 
[-,T] 

TTTCTGTTGTTTCAAAACAATGTGCAGTTT1TCTATATAAGAAGAAAAGTCTCCAGGCCC 
AACATCCATAGGGCTCATCATCCATTG I I I I I CTTTTAAGTTTTCAATTTAATCCAAATA 
AGTCAAAAATTTTCAGGTACCTACTATCTGCCAGGTGCTGTGCCGTGCGCTGGGGCTACZA 
CAGATGGAGAGGGTGCATTCTTGGATCTCTAGTGTTTGGGTTTGGATTCATTCACCCACA 
CTCTTTCACCAGTTCTCT7TGTTACTGGGGTGCTCATTTGTGAGCCCTGCTTCCATGGCT 



Additional SNPs 3' of the ORF (DNA positions refer to the genomic 
sequence provided in U.S. Serial No.. 60/265,151, Attorney Docket 
NO.CL001098-PROV, filed January 31, 2001): 

56707 CTGCCAAGATCGATGCTGGGTGGAGTAGCCCGATTCTACAGATGAGGAAACAGAGGCTTT 
CTCCTAGGCTCACAGAGAGGTGTATACTTATTGGAAATGGCTGAAATCTGCATCCAAACC 
TTGTTCTCCTTTCATTAATCATGTTGTATCCCTCTGTCTCATTTTGCAGGATGAGTCTGA 
AAAATAAATTATATT G AATTG CA TCTG CTATAGTG C CTAACTTTTAGTAGGTACTCAATA 
AAAGTTCCCATTCAG I I I 1. 1 I I I G I I GGTTTG I I I I I I I I I I I I I I U I I I I I M I I I I I 
[-,T] 

TTTAGATGAAGTCTCACTTGTCACCCAGGCTGGAGTGCAGTGGCATGATCTCGGCTCACC 
GCAGCCTCCGCCTCCTGGGTATAAGCGGTTCTCCTGCTTCAGCCTCCCGAGTAGGTGGGA 
ATACAGGCGCACGCCACTGAACCCAGCTAA I I I 1 IGTAI I I I I AGTAGAGACAGGGTTTC 
ACCATGTTGGCTAGGCTGGTCTCGAACTCCTGACCTCAAGTGATCTACCTGCCTCAGCCT 
CCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGCCTCCATTTGG I MM I ACACAGTGT 

57444 CTTTTG TTTT TACTTG(^TGGCCTAAACCTTGCCrG 

TCTG AGTTTTC AAATG CCT AC CTAAGTCAGTTGTGTG CC CGTT AAG G CTGTTCTT ATTTG 
GTGATGGGTCCATTCTGCTCCCCACACCTAGACTGCAGTAATATGTCTGCCCACTCTGTA 
CCCACAGGGCCTTTCCCTGGGAGAATCCTTGTCATCTACATGCTACATAATGGGTATCCA 
GTAAGGGCTACTAAATCACAGACAACATTAGTGTTACCATTGTTAATTTCACCCAAACAT 
[A,G] 

GTATA GTGG AAAGAGTGCTGAAGTTTTCTACAGAGCACACCTGACCACTTCTAGTGCTTC 
TAGTATTTTACACAAGCCCCCTACCTGGCACATAATAAGTAGTCAGTAGATATTTGTGAA 
ATGAAATCAATGACTAAGAGTTAGAAAACCTGAGTTCTAGTTCCAATTGCACCCTCTTGG 
ACTCTAAATAATTCTTATTGTCTGCTTTGCAAAGAAGAACATGTAAGTTATTATTCACCT 
TAATAATGGTAATAATAATTG AGG ACTAACTATG AG CTAG A CACTTTACATATTTTACTT 

58021 CTAGACACTTTACATATTTTACTTTACTTTGATTCA 

ATGTTGTAAACCCTACTTTACTGATGAGAGAGCTGAGGCTTTGAGAGAGGTATGTGACTT 
GCTTAGGGTCACCTAACCAGGAAGGGGGTGACCTAGAATATCCACACCTGTTGGACTCCA 
AGTGGGTCGTAGCCGTATTCCACTTTACCTTAATTATCAAAGGGCAG AT ATAATG ATTTG 
GTAATTAAAACAGTGCCCTAGACTGGGGACTGGCCATAAACTTTTGCAGGTTCATTTCCA 
[A,G] 

ATGATTATGTCTTTCCACCCAGACGTGATTTTAGAATCTGTGAGCCTGGCGACATATTTC 
AGGAGTGTAATCTGAGTGGAGTTCCTACCCTGGGACTCCAGCAGCCTCCCTCGCTACAGC 
TAATGAAGCCACrGGCGTTCACTGAATCTATGAACCTCGATTGGGCTCATTACTGTCTCA 
CTTCTGGAGCGCTGCTGAAAGCTCCTGGGGAGTTACTATTAACATCTTTATTAACCCGTC 
CCCCAAATCAGTCAGTGAAATTAATTTAGATACACAGCCACTTTGATCTTCATTAATGGT 

58064 CTTATCAGATAGAGACAATGTTGTAAACCCTACTTTACTGATGAGAGAGCTGAGGCTTTG 
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AGAGAGGTATGTGACTTGCTTAGGGTCACCTAACCAGGAAGGGGGTGACCTAGAATATCC 
ACACCTGTTGGACTCCAAGTGGGTCGTAGCCGTATTCCACTTTACCTTAATTATCAAAGG 
GCAGATATAATGATTTGGTAATTAAAACAGTGCCCTAGACTGGGGACTGGCCATAAACTT 
TTGCAGGTTCATTTCCAAATGATTATGTCTTTCCACCCAGACGTGATTTTAGAATCTGTG 
[A.C] 

GCCTGGCGACATATTTCAGGAGTGTAATCTGAGTGGAGTTCCTACCCTGGGACTCCAGCA 
GCCTCCCTCGCTACAGCTAATGAAGCCACTGGCGTTCACTGAATCTATGAACCTCGATTG 
GGCTCATTACTGTCTCACTTCTGGAGCGCTGCTGAAAGCTCCTGGGGAGTTACTATTAAC 
ATCTTTATTAACCCGTCCCCCAAATCAGTCAGTGAAATTAATTTAGATACACAGCCACTT 
TGATCTTCATTAATGGTTGCTTCTAGTTTCTACATTTA I I I I I AGTTTTCTCATCTTTAA 

59067 CTGTAATCCCACCACTTTGGGAGGCCGAGGCGGGTGGTTCACTTGAGATCAGGAGTTCGA 
GATCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCAGG 
TGTGGTGGCAGGCACCTGTAATCCCAGCTACTCAGGAGACTGAGGCAGGAGAATCACTTG 
AACCCAGGAGGCAGAGGCTGCAGTGAGCCAAGATCATGCCGCTGCACTCTAGCCTGGGCA 
ACAGAGTGAGATTCCATCTCAGAAAAACAACAACAACAACAAAAGAAACACACATAAACA 
[T,C,G] 

AGCAGGTATACAAGTGGGCTCACCTGAGTTACTTTAACG Mill AAAGTAAGGTTTAATA 
TATTTATTTCA I I II IAI I I I I CATCCAACCATCTACTTCTTATTTATTCAGCAAATATG 
TATTGAATGCTAGGTACTAGGGACTGAGAGAGTAAAAAATAAATATGACACTGGTACCTG 
CCTTCCAGGAATATAGTCTAGAGGGGAAAATAAATAAATAATACAATGTGGAAAGTATAC 
TACAGCAGCATTGCCTGGGGATCTTTAAGAAACGAGGGAAATCTTGGACTTGGCTGTGAT 

60034 . AGCCACCGTGCCTGCCCTAGGGATGG I I I I I AAAGAAAGAGATGTGAGCTTGGCCTCCCA 
AAGTGCCGGGATTACAGGCATGAGCCACTGTGCCCGGCCACCCCTGCTATTTTATTCAAA 
GGAA ACAGAAGGGGGAAGGAACAGATTTATAGAGAAGGTAATGAGTATAAGGTTTTAACA 
TTTTGAGATTAAGGTATCTGTGGAACCACAAAATGGAGATATCTAAAAAGCAAGTAGAAA 
TATAAACTGTGAGTACAAAAGTCCAGCCTAGAAATACAGAAGTCACATGCTTAAATCGGT 
[A,G] 

TTTAAAGCCTCAGGTGTGAATTTAGTTGCCCAAAGGAGAGAGTGTAGAGAAAGAAGAAAG 
GACAATGAGGGAACCCTGTGGGGCCAGTACTGCAGGACTGGGTAGAGGGAAGGGAGTCTT 
TAGAAGCTGAGAAGGAACAGCCAGAGGAGCTAAAGGAGACTAGTAGGTAATCCTAGCAAT 
ACCATGGAAATCATATGGCCAGCACTCACTGGAAGAGGGCTGGGTTCCATGCCTGTGTAT 
TCTCAATGCTGAGCTCATTGGCTGCATGTGGTGGCCCTTCAATATTTGTGGAGACTACAA 

63709 TACTTTCACAACCCTATGCAGTAGAAACTATTACTATCCCTCTTCTGCAGGTGGGAAAAC 
TGAGGCCCAGTGAGGTTAAGCAATATCTTCCGGTCACACAGCTGGTAAGAGGCATAGTGT 
GGGTTCACCCGGGTCCTGGGTCCAAGTGCCTGGCCACGCATTGTTTATACTGCCCATAAC 
TTTTCAAAGGAAGGGAAAGAGGAAGGAAAAGCAGATAGGATTTATTTTGTTTAATGGTCC 
AGAGCAGGCATTTTGCACTCATTATTTCATTGGTTCGCTTCAGGTGAATACCACCTTCAA 

[A,T] 

TTGCATTGAGAAAACTGAGGGTTGCCCCTACAGAATGCGAGAAAA I I I I I GCAATCTACT 
CATCTGACAAAGGGCTAATATCCAGAATCTACAAAGAACTTAAATAAGTTTACAAAAAAA 
AAAAACCATCAAAAAGTAGGCAAAGGATATGAACAGACACTTCTCAAAAGAAGACATTTA 
TGCAGCCAACAGACACATGAAAAAATGTTCATCATCACTGGTCATCAGAGAAATGCAAAT 
CAAAACCACAATGAGATACCATCTCCCACCAGTTAGAATGGCAATCATTAAAAAGTCAGG 

63817 GAGGCATAGTGTGGGTTCACCCGGGTCCTGGGTCCAAGTGCCTGGCCACGCATTGTTTAT 
ACTGCCCATAACTTTTCAAAGGAAGGGAAAGAGGAAGGAAAAGCAGATAGGATTTATTTT 
GTTTAATGGTCCAGAGCAGGCATTTTGCACTCATTATTTCATTGGTTCGCTTCAGGTGAA 
TACCACC!TTCAAATTGCATTGAGAAAACTGAGGGTTGCCCCTACAGAATGCGAGAAAATT 
TTTGCAATCTACTCATCTGACAAAGGGCTAATATCCAGAATCTACAAAGAACTTAAATAA 
[G,A] 

TTTACAAAAAAAAAAAACCATCAAAAAGTAGGCAAAGGATATGAACAGACACTTCTCAAA 
AGAAGACATTTATGCAGCCAACAGACACATGAAAAAATGTTCATCATCACTGGTCATCAG 
AGAAATGCAAATCAAAACCACAATGAGATACCATCTCCCACCAGTTAGAATGGCAATCAT 
TAAAAAGTCAGGAAACAACAAATGCTGGAGAGGATGTGGAGAAATAGGAATGCTTATACA 
CTGTTGGTGGGAGTGTAAACTAGTTCAACCATTGTGGAAGACAGTGTGGCGATTCCTCAA 

64845 TAGGAGAAATAGCTAATGTAAATGATGAGTTAATGGGTGCAGCAAACCAACATGGCGCAT 
GTATACCTGTGTAACAAACCTGCAGGTTGTGCACCTGTACCCTAGAACTTAAAGTACAAT 
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AATGAAAAAAATTAAATTAAAAAAATCAGAAAAAAAAAGAAAGAAAACTGAGGGTTGCCA 
TGGATGGTCAAAGTATCTTACATGAGGTCTCAGCAAAACTGAAGCAGCAGAGTCTGTATT 
GAAACCCTAAGTCAGCTAACCCCCAACCCACGTGCCACAAGAACCCCACTCAAGTTAGAA 



GAGTATTAAAATATTGGAGGCCAAAAACACAGCACATAGACCTCTCAGAAGGGAAAAATA 
GAAAATGGGAA AGGA TG GAAA TGGATTTCATTTTGTTGATTGAAATTGGGAGATGACTGA 
ATGAGGAAATATTTTGATTTTGTTTGGCCAGGGAAGAGCTGTAAATAGGCAGCAGAAAGT 
GGTTTGAAGAGAAACATACTAAACAAATGCCAGATATTCTTATCTTTACTGGTTATTTT 
AACTTAGGCAAAATCATACTGAAC^GAGAGTAAATAACCTGTATAGACAATTGCTTCTCT 

GAGAAATAGCTAATGTAAATGATGAGTTAATGGGTGCAGCAAACCAACATGGCGCATGTA 
TACCTGTGTAACAAACCTGCAGGTTGTGCACCTGTACCCTAGAACTTAAAGTACAATAAT 
GAAAAAAATTAAATTAAAAAAATCAGAAAAAAAAAGAAAGAAAACTGAGGGTTGCCATGG 
ATGGTCAAAGTATCTTACATGAGGTCTCAGCAAAACTGAAGCAGCAGAGTCTGTATTGAA 
ACCCTAAGTCAGCTAACCCCCAACCCACGTGCCACAAGAACCCCACTCAAGTTAGAAAGA 



TATTAAAATATTGGAGGCCAAAAACACAGCACATAGACCTCTCAGAAGGGAAAAATAGAA 
AATGGGAAAGGATGGAAATGGATTTCATTTTGTTGATTGAAATTGGGAGATGACTGAATG 
AGGAAATATTTTGATTTTGTTTGGCCAGGGAAGAGCTGTAAATAGGCAGCAGAAAGTGGT 
TTGAAGAGAAACATACTAAACAAATGCCAGATATTCTTATCTTTACTGGTTATTTTGAAC 
TTAGGCAAAATCATACTGAACAGAGAGTAAATAACCTGTATAGACAATTGCTTCTCTTAG 

TGTAACAAACCTGCAGGTTGTGCACCTGTACCCTAGAACTTAAAGTACAATAATGAAAAA 
AATTAAATTAAAAAAATCAGAAAAAAAAAGAAAGAAAACTGAGGGTTGCCATGGATGGTC 
AAAGTATCTTACATGAGGTCTCAGCAAAACTGAAGCAGCAGAGTCTGTATTGAAACCCTA 
AGTCAGCTAACCCCCAACCCACGTGCCACAAGAACCCCACTCAAGTTAGAAAGAGTATTA 
AAATATTGGAGGCCAAAAACACAGCACATAGACCTCTCAGAAGGGAAAAATAGAAAATGG 
[G,T] 

AA AGGA TG GAAA TGGATTTCA1TTTGTTGATTGAAATTGGGAGATGACTGAATGAGGAAA 
TATTTTGATTTTGTTTGGCCAGGGAAGAGCTGTAAATAGGCAGCAGAAAGTGGTTTGAAG 
AGAAACATACTAAACAAATGCCAGATATTCTTATCTTTACTGGTTATTTTGAACTTAGGC 
AAAATCATACTGAACAGAGAGTAAATAACCTGTATAGACAATTGCTTCTCTTAGTGCCCA 
CCTGGGACrATAAAATGCCAGCAGAGAGGTCCACATTTGATTGCGCCTGACCTTGAAAAC 

AGGAGGATGTGAGTCAAATTGCATTAGGGCCCACCCTAATGCTTTCATCTTAACTAATTA 
CATCTACAATGA CCCT ATTTCCATATAAGATCATATTCTAAAGTAATTGGGGTTAGGACT 
TCAACATGTGAATTTTGGGAGCACATAATTCAACCCATAATAGTGACCTTTCACCAAATC 
ATAAAAATTCATCAGGAGATGAGGCTTTAAAAATCACATTAGCCTACCTGATACTTGAAT 
CAI I I I I I I AAACCAAGCCAAGAAGAGCATTTAGAATTTTAACAGTATATTTGGCAACAG 
[ G , A] 

GTTTTCGGGTGGATTTTA I I I I I I AACGCCCTCTGTATGCTTCCCAGAATGGTTCCCACT 
GCCTACGCCTCGAAGCCTTCATGTTGTCTGGAACAGGTGAGTACTACCTCAGGAAGGGAT 
CTTTAAGGGTTCTTTAAGCAGGATTGGAGAGACATTTCCCTGGATCTCAGTCCACTGAAC 
AGCAGCCCCCGAGCACTTCCATGTGGGGGCTCTAAGCTGTAGGAAGATGCCTCTGCAAGC 
GCCAGACCCCTGAGAGTCTGTTAA I I I I I I I CTATGAACCATTTTACTTTCAGTGAGTTT 

AATTTTGGGAGCACATAATTCAACCCATAATAGTGACCTTTCACCAAATCATAAAAATTC 
ATCAGGAGATGAGGCrTTAAAAATCACATTAGCCTACCTGATACTTGAATCA I I I I I M A 
AACC AAGC CAAGAAGAGCATTTAGAATTTTAACAGTATATTTGGCAACAGGGTTTTCGGG 
TGGATTTTA I I I I I I AACGCCCTCTGTATGCTTCCCAGAATGGTTCCCACTGCCTACGCC 
TCGAAGCCTTCATGTTGTCTGGAACAGGTGAGTACTACCTCAGGAAGGGATCTTTAAGGG 



TCITTAAGCAGGATTGGAGAGACATTTCCCTGGATCTCAGTCCACTGAACAGCAGCCCCC 
GAGCACTTCCATGTGGGGGCTCTAAGCTGTAGGAAGATGCCTCTGCAAGCGCCAGACCCC 
TGAG AGTCT GTTAA I I I I I I I CTATGAACCATTTTACTTTCAGTGAGTTTGGTCGTTAAA 
ATTGTTTTGTGTCCCTCAGCCATGCCCCAGGCCCTGAGAACGAGGGAGTGTTGGTCTGCA 
AGAAAACCTAGTGGGTTTATTATTCTCTGACACAGAGAAACCAAATAACATCATTGAGTG 

GGGAATAGTTAGCTAACTAAGCTGTGCCAGGCAACCTCCGGGCTAAGAAGAACTCAGTGT 
TTTCGGACAATGACCAATTACAATAACCAGTATTATTTGATCTGAGAGTAATTAGCCGAG 
GCTCTGTTC I I I I I GCTTCAGTGAGGAGGCAAAAAGGGCAATGAGGAAAACATCAGAGAC 



[A,-] 



[G,-] 



[T,A] 
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AGGGGAAACGAGCTCAAATGTCAGAGAAAAACACAGTCTTGCAGGTGGGGAGAAGTGGAA 
GAGTTTCACTGGCCAAGATCCTGACTGAACACTCGAACATTG I I I I I CCCTGAAAATATG 
EG,-] 

TAGAATTTAACTTAACCAAAAGTTGTTTGAATTCTTCACT 

AAAAGCCTCCACGTAGAATAAAATATCAGGGTACAAAGAGTAAAATAGGTTAGGAACATA 
GAACTATGGACTA CCAG AGATCTTTCACTGGGAGGGACAGGTATCTCCAGGACATCTGCA 
CCCTCCACTCTTATITTCAACAGCACTGTATAACCTGAAACCTTITGGAGACAGGTTCTA 
GGGAACTACTGTTTATCACTTCATTCAAGACGTGCAGTATAGTCAGGCTCCTTCCCCTCT 

TGAA1TCTTCACTCTTACTGTTCATTTCCTTTAAAAAGCCTCCACGTAGAATAAAATATC 
AGGGTACAAAGAGTAAAATAGGTTAGGAACATAGAACTATGGACTACCAGAGATCTTTCA 
CTGGGAGGGACAGGTATCTCCAGGACATCTGCACCCTCCACTCTTATTTTCAACAGCACT 
GTATAACCTGAAACCTTTTGGAGACAGGTTCTAGGGAACTACTGTTTATCACTTCATTCA 
AGACGTGCAGTATAGTCAGGCTCCTTCCCCTCTACCTAGAGATATGTGGATTTTAGCATG 
[C,G] 

CCCGTGGCTTTCTTTCATGTCACTATCAGCTTCCAAAGTAAATGGCAACTCTCAACCTGA 
GAACCATCTTAATACTCCATGCTGCTGCTGCTGCAGCTGCCCCTGTCCCATGGCAGAAGA 
TAGTTCACCAGCCTCCTGCTATCACCCCACCAGCCT1TCTTTCGAGGGCTGAGCAAGGCT 
TGAATTCTCTGAATACCTCTCTGGC I I I I CTGACGATATAGCACCCATGGCCTCTGCGCC 
TTCCCTTGTCCCCTAGCAATATGTATTCACCTTTTCTCAGACTTCTGGCTTCTCTGCTCA 

CTGTGGA TTTG GCTGTTGAATCAGGTTGCAGATGTGGCGAGTAACAAGCAGCCGAAGTGT 
CACTGGCTTTTCTACACAATCCCTTCAGAGGTGCAGTGGAACCATTCACAGGCCTGTAGA 
TGAATCCCCG GGGCT TGCTTGAGGTTTGATGGTGATCTCCTTAATGTGGGCCTAGTAGGT 
ACGCACTCACTTTTTAGATTGATTGGCAAATTGTAGGACAGCTTCTGCAATGTTTCTTCC 
TAGTATTGGATGTATTTCTGACTGTGGGCATCTACTACATCCCAAACTGAGAACATTTAA 
[G,A] 

AAGTGCCTACTCTATCCCTGCCCTGTTCCTCTTCCTGTAGGTCTAGGGGGAAAACCAAAG 
AACCCAGTTGAGCTCATACTGACCAGCCTGTCCTGCACCTCAGAGGTCAGTGGCAGTGCT 
GTGGGTGGGAGACTTTTATACTCAGAGCCAGGCTGTGCTAGGAAATCAAAACTGCTGTCC 
CAGACCCCTGAGAACAACCAGATTTTAGTAGTATGGAGTGATGGGTTAGTAGATGGTGTT 
TTACGCAGTCACAAAACTCAATTTTCA I I I I MCI I ATTGTTTGTGTTTCTATTCTGGAC 

TAGTTAACATCACAAACCTAAGAGTATGAATTCACATTTACCTTGGAAAGGTGCTGGTAG 
GCAAATTAACATTTGTGATGTTCA I I I I I I I ATGCCAATGTTTA I I I I I AAAGTAGGGGT 
TGTAACCAGTGGATAATTGTTCCAGCGGGAGTTAGAGGTGGTGGCCACATGTACAACTAT 
GACAGTAAAGGCAATTGGTAACAAGCAAGGCCATAGGTGACAGGAAAAAGCATAATGATA 
CAAACACGGAAGATGTCAGTAAATCAACATTTGTGCCCAAAAAGTAACATTTGTTTTCCC 
[A,G] 

G G ACTCTTTC CTC AG C CTT G C AAAAAG AG CTC AATGTG CG ATAG C AAGG CG G CG ATTACT 
GGGGCACTTGCCAAAGGCCCAGATCCTAATGGTCTTGCTACCAGTGCTGCCAGTGAAGGT 
TTGTGTTCATGTGGGGTGGAGTTGGCTTGTACAGGGGAGACCGACACACACTTGGCTCCT 
AGGAGGAAGAGACTTCACTTGCCCACTTTGCCCTGGTGACCACCTCTACCGGCGAAGGAG 
AAGGAGAAGCAAAGTGCAGCAGTCATAGTTCAGAGGCTGTGAGACTGGAACAAGTCCAGA 

AATTGGTAACAAG CAAGG CCATAGGTG ACAGG AAAAAG CAT AATGATACAAACACGGAAG 
ATGTCAGTAAATCAACATTTGTG CCCAAAAAGTAACATTTGTTTTCCCAGG ACTCTTTC C 
TCAGCCTTGCAAAAAGAGCTCAATGTGCGATAGCAAGGCGGCGATTACTGGGGCACTTGC 
CAAAGGCCCAGATCCTAATGGTCTTGCTACCAGTGCTGCCAGTGAAGGTTTGTGTTCATG 
TGGGGTGGAGTTGGCTTGTACAGGGGAGACCGACACACACTTGGCTCCTAGGAGGAAGAG 
[T,A] 

CTTCACTTGCCCACTTTGCCCTGGTGACCACCTCTACCGGCGAAGGAGAAGGAGAAGCAA 
AGTGCAGCAGTCATAGTTCAGAGGCTGTGAGACTGGAACAAGTCCAGAAACCACCAGAAG 
CCCTGAAACAACCCTGTGGGGAGGGTGAGGGTAGGTTCTGTTGGTTGAAATGGAAAATCC 
TAAAAAAGAGGTCCTTGCACTAATTAGTACCTACCCTTCTTTTATTCATTCACCTGGTTT 
AAAAAAAACAAAAACAAAAAC CTGTTCTTATTTTA II II I I I GAGACAGGGTCTAGTTCT 

CAAGG CCATAGGTG ACAGG AAAAAG C AT AATGATACAAACACGGAAGATGTCAGTAAATC 
AACATTTGTGCCCAAAAAGTAACATTTGTTTTCCCAGGACTCITTCCTCAGCCTTGCAAA 
AAGAGCTCAATGTGCGATAGCAAGGCGGCGATTACTGGGGCACTTGCCAAAGGCCCAGAT 
CCTAATGGTCTTGCTACCAGTGCTGCCAGTGAAGGTTTGTGTTCATGTGGGGTGGAGTTG 

FIGURE 3DD 



Docket No.: CL001098DIV II 
Serial No.: TO BE ASSIGNED 
Inventors: YAN, Chunhua et al. 
Title: ISOLATED HUMAN KINASE ... 



GCTTGTACAGGGGAGACCGACACACACTTGGCTCCTAGGAGGAAGAGACTTCACTTGCCC 
[A,G] 

CTTTGCCCTGGTGACCACCTCTACCGGCGAAGGAGAAGGAGAAGCAAAGTGCAGCAGTCA 
TAGTTCAGAGGCTGTGAGACTGGAACAAGTCCAGAAACCACCAGAAGCCCTGAAACAACC 
CTGTGGGGAGGGTGAGGGTAGGTTCTGTTGGTTGAAATGGAAAATCCTAAAAAAGAGGTC 
CTTG C A CT AATTAGT A CCTAC CCTTCTTTT ATTCATTCA C CTG GTTT AAA AAAAA C AAAA 
ACAAAAACCTGTTCTTATTTTA I I I I I I I GAGACAGGGTCTAGTTCTGTCACCCAGGCTA 

72590 TCTTGGCTTCCCCAGACACTCTACACAAACACAGTCTAAAT AAAAA CAATATGCCTGCCA 

TGCTAACACAAGTTTAAAACAGGTCTCCCAGCCTTCCCTGTCCAATCAGACTGCAGCCCC 
TCAGCCCTCACTACAGGGCAGTTCAGGAGCCATCTGTTGTGGGTTAATGCTGGGGGAGTA 
TTCGAGAGAACAAATCGCTGATTGGGGTGGGACTGTGGAGGTGGAAGTTGATAGATGCAT 
TGCTCCCCACCACCTACCCGCCCTCCCCATGTCCAGAAGTTCATATTTAACCAGAAAGCA 
[T,C] 

GAAGCCTGCAGAAAGTGGATTCAGGGCCAAATTTTGGCACAAACTTCACCTCACTACCCA 
ATAGTGTGGGTGGACCAGC!TAAAATTTGTTCATGAGATTTCCCCTGAGTCTCCTTCCAGA 
GAGATCCTAGGCTCTAGTGATTTGTACTAACATTCCTCTTCCAAACGTCCATTTCATGGG 
CCCTAAAAAACCTCTGCCTGGACAGGATACCCTGGATGAGGAATATATCTGCCAGTCCTT 
CTTC AG G C C A C AG C AAATG AGT AG ACTTCT ACACAATTCCTTTTTCTCCT AACTG G CCTG 

72757 TGCTGGGGGAGTATTCGAGAGAACAAATCGCTGATTGGGGTGGGACTGTGGAGGTGGAAG 
TTGATAGATGCATTGCTCCCCACCACCTACCCGCCCTCCCCATGTCCAGAAGTTCATATT 
TAACCAGAAAGCATGAAGCCTGCAGAAAGTGGATTCAGGGCCAAATTTTGGCACAAACTT 
CACCTCACTACCCAATAGTGTGGGTGGACCAGCTAAAATTTGTTCATGAGATTTCCCCTG 
AGTCTCC1TCCAGAGAGATCCTAGGCTCTAGTGATTTGTACTAACATTCCTCTTCCAAAC 
[G,C] 

TCCATTTCATGGGCCCTAAAAAACCTCTGCCTGGACAGGATACCCTGGATGAGGAATATA 
TCTGCCAGTCCTTCTTCAGGCCACAGCAAATGAGTAGACTTCTACACAATTCC I I I II CT 
C CTAACTG G C CTGG AAATTTG CAAGG ATTTGG AAATTTG CAAA INCH CTCTATCTTGC 
CTATGAAGCAATTATTCCTGAGTTTGAAAGTAGTTATCTGTTTGCAGAAACAGTTTCATG 
TCTTGGGAATCAAATAGCTCAAA I I I I I I AATGCAAATCTTATGCAAAAATTAGGTAAAT 

72863 CAGAAGTTCATATTTAACCAGAAAGCATGAAGCCTGCAGAAAGTGGATTCAGGGCCAAAT 
TTTGGCACAAACTTCACCTCACTACCCAATAGTGTGGGTGGACCAGCTAAAATTTGTTCA 
TGAGATTTCCCCTGAGTCTCCTTCCAGAGAGATCCTAGGCTCTAGTGATTTGTACTAACA 
TTCCTCTTCCAAACGTCCATTTCATGGGCCCTAAAAAACCTCTGCCTGGACAGGATACCC 
TGGATGAGGAATATATCTGCCAGTCCTTCTTCAGGCCACAGCAAATGAGTAGACTTCTAC 
[A,C] 

CAATTCCTTTTTCTCCTAA CTGG CCTG G AAATTTG CAAGG ATTTGG AAATTTG CAAATTT 
CTTCTCTATCTTGCCTATGAAGCAATTATTCCTGAGTTTGAAAGTAGTTATCTGTTTGCA 
GAAACAGTTTCATGTCTTGGGAATCAAATAGCTCAAA I I I I I I AATG CAAATCTTATG CA 
AAAATTAGGTAAATTAGAATTCTGGGCCAAGACTTCCAGCATCTCTGCATCATGGCACCT 
G G AAGG G AAATAATATCTG ATCTTTCTTG CTTTAAGTCAGTAGGTGTATTTG ACATTAAG 

74565 AAGATCAACTTGAGGATTTGTCCTTACTGTGATTTAAGTACTTAATTCTCTCTCTTCCTC 
TGCTCCATCTCAAGGGAACCAAGAGAGCTTCCTAACTAATAAGACTGGGTGAGCTTCAGA 
AATGCTGCAAACAAGTGTATTTTCTACCCAAGCTTCACCCTCCAGAATCACCTCTTTAGA 
ATTCTG TGCTTTT CCCCCTCCAGATTTCTGTGCTTGCCACCCTTGGAGTCCAGAGCCTTG 
AACATG I I I I I I GTTTGTTTGTTTGAGACAGAGTCTTGTTCTGTTGCCCAGGCTGGAGTG 
[C,T] 

TGTGGTGCAATATTAGCTCACTGCAGCCTCTACCTCCCAGGTGCAAGTGATTGTCATGCC 
TCAGC CTCCTGAGTAGCTGGGATTACAGGTGCATGCCACCACGCCCAGCTAA I I I I IATG 
TTTTTAGTAGAGACAGGGTTTCGCCATGTTGGCCAGTCTGGTCCCAAACTCCTGGCCTCA 
AGTGATCrGCCTACCTCAGCCTCTCAGAGTTCGATCGTATTTTGAATCAGGCCTTTCTAC 
TTGGTTGGGGCTGCAGTGTTCCTCCAAAAAGAATCCTTGATTATATATTTCTGTTTATAA 

74850 GCCCAGGCTGGAGTGCTGTGGTGCAATATTAGCTCACTGCAGCCTCTACCTCCCAGGTGC 
A AGTG ATTGTC^G CCTCAG CCTCCTG AGT AG CTGG G ATT ACAGGTG CATG CC ACC ACG C 
CCAGCTAA I I I I I ATG I I I I I AGTAGAGACAGGGTTTCGCCATGTTGGCCAGTCTGGTCC 
CAAACTCCTGGCCTCAAGTGATCTGCCTACCTCAGCCTCTCAGAGTTCGATCGTATTTTG 
AATCAG G CCTTTCTACTTGGTTGGGG CTG CAGTGTTCCTCCAAAAAGAATC CTTG ATT 

FIGURE 3EE 



Docket No.: CL001098DIV II 
Serial No.: TO BE ASSIGNED 
Inventors: YAN, Chunhua et al. 
Title: ISOLATED HUMAN KINASE ... 



[A,G] 

TATTTCTGTTTATAATTATATTTCTGTGTCTTTTATGTGTACATGGTGCCCAGACACACG 
GTGGAGGGCAAGGGAGCTGAGTGATAGAATGAGGAGA I I I I I CCCCTTCAACTTAGTAAC 
AGGACAGTCAGTAAGAGTTTAGTATGGGGGTTAC I I I I CCAGTTGCCATCAACTTTAGCA 
ATTCCAGG AAAATTCTACTTAAAATCTAAGTCTAG CCATCTCC I I I I C I I I AC I I II I GT 
TTTTATCCCTTTG I I I I I CAGAAGGGAGGCTGCTGGAGAAACTCTA I I I I I CTCTCCCAC 

75462 GGGCAGGGTTCCTGAAGCZTGAAGACCTGGCCTCCCAGATGCTAAAAGGCTTTCCCAGAGA 
CCGCGTCTCCGCCCAGGAAGCACTTGTTCATGA1TATTTCAGCGCCCTGCCATCTCAGCT 
GTACCAGCTTCCTGATGGTGAGCGAGGGAGTGTGTGCGTGTGCGTGAGTGCATGTGCGTG 
AGTGCGTGTGTGTGTAAGTCTTGGTGTCTTAAGTAGTTTGCCTCAGCACCGGAGAATCAT 
AGCATTTACCCCCAGGAGTGAAGTTAGAGATCAGTTTGCCTGGGAGAAGCAAGATAGGAC 
[G,C] 

TAATCTTGTCTG AGTTTTGCATG AG CTTCTCTCAAACTCTG AAGCACTGGTGGGG AG AGG 
TTACAAGGACCTCTTTGTAATGGTCCAGACATTTCACAGGTACACATTTACACTCAGA 
TTGGATGAGATGGACGTTGTCTCrGAGTTGTTTGCCTTAGCTACCATCTGCCAGGAAGCC 
ACACCAGTGTTTCTAACAAGACTCTTCTCCCTTTCTGCGGAGAGTAAGATGATAGGTGAA 
GCGGATATGTCCTGAGAGGGAGCTGGGCC I I I C I I GATTGGTAATAGCAAGCTGCAGAAG 

76045 AATAGCAAGCTGCAGAAGGCAGTGCCGGGGTGGGGTGCTGGGGGAAGGGAGGTCTCCTCC 
GAGGCTGCCCACCTCCTGTAAACACAAGTTACCITGGAATCAGCCTTTGCTGCCTGGGCC 
TGAAGTCATCTTCCAAACCTGAAATCTGCTGAGGGATCATGTGGTTGGAACTCCCGTCAT 
GTTTCTTGGACTGTGGCCAGGAAGGGCTACAGGAGGATGTGGGCAGTGGTCATGGGGGAC 
TATCCTTAGGTGAGG I I ICI I I GAAAAAAATCCACCATTCAACTAAAAATAGTTTTAGGA 
[A,G] 

CAAGGGACAAGTAGACAAGAAAAGCAAAGCTACCAGGAAGGAAAAACAGAAGAGAGAGAA 
GCAGCCAGGGTTGGTTTTAGAAGTCCTAAGCCAGAGAAAAGAAAGCTTAGGAGGGAGGGC 
CTAAGGGCTGGAACTGCGATGAGGACACTGGAAGGGGCGTGGGGGAGTGGCGATGGGGTG 
CAGTGAGATAAGGATGGGGACAGGGTGAGGAGCTACTGCCCAAGAGGTGTGGAAACAGCC 
TGGTCCGCCTTGCCAGAGTTTGCAGACGCAGCTCCTCTGTGTCTAGGACATCGTCCTGCA 

79973 ACTTCAGGCCTTCTTTG AGTGTTATCTCCTTAATATATTGG AAGGTCCCTATG ATATCAC 

TTCCCAAAGGCCACTTTAGGCTGAATTCCTTTAACCTTTGTTTCCAAATACAAATTCTGT 
TTCTCCCACTTTTTAGTCATTTTGTTTTATTG1TCTCTTCTGAATCTGTT 
TGCTTATCTCTGAGAATATGGAGGCAGCATAGACTCAAA I I I C I I GCATGGTGGGAGAAT 
CATGTTCCATTCATTCATTGGAGGTTTACTGAGTGTCCATGAGGTGTGATATTAAAGGAT 
[G,A] 

CAAAAATGAATTAGTCATAGACCCTGTCCTCAAAGAGCTTACGGTCCTTTAGGAAATAAG 
A C AAGTATATAAAC AG CTG CAG ATCAAGTATACAAATAAAGTG CTCTAAGTGTTTG AAG G 
AGGTGAGATTAACTCCAGCTGTGGGGATCCAGGATGATTTTGTGAGTGGGCCTTGAAAGA 
TGATGGGTCTGGATGATTGGAGAGGAAGGGCATACGGGCAAAGGAATGTCGTGTTCCTCT 
CAATACCATCTAAGATCATTCTGAAGTTGTCTTC I I I I I AATCATAGTGGCTCAGTCTTA 

83181 ATGTATGTGTATTGTGCCCCTAGTCAGAGTTTTAAGTGCACTTCGTGTTAGTTAGCAACT 
TCAATTTAAAGTGCTGTGTGATCCTCTGGGCTTTGAAGATATGAATTCTCTATACATACA 
CATGAGCTAAAATTACCGCATAATCACGAGGATGTGTGCACTTTGAATAATTGAATGCAC 
TAAACATCTCCAGCTGATGGATTGATGATGTGGGAGAGTAGTGCAAAACCCACGTGACAC 
GGAAGATTACAAGACATTTTGAAATGATAGCTTAATTTTCAACC I I I I I I M I I II I I I I 
E-,T] 

GCTTGTGCTTTCTC I I I I AGCGATTATGGGCAAGTAAAAGAGATTTGATTATCTTGTTTC 
CTGTGAAAGGTTATTTAGCCCCTCAATCCCATTAAGTCACTTTCTGCACTCCAGTGCACT 
GAGCCTTCAGGTTATCAGCCCAGGAAGAGTAAATGGAGACACCAGGATCAGATTTCTAAA 
GGAAGGAAAGTTAAAGCAAAGTTTGAAAGGGGTGACAGCCAACAGCCATGAGCTTAGGAA 
CTCAATTAGACTAAAAAGACAACTATCTCATTATCTCGTTCCGAAATGATCTAAATACAT 

84935 GAAGTTTTAT TTTC CGTACTTTTCAGATAAGCCCCTTGCAATGTCAALI I I I GGAGGAAC 

TGGCTTGTAGCTTTAGTGTAGTGAGCATTCAGGAGTAGCAAGCACTTAACAAACACAGGC 
ACTTGGGAAAATGCG AACT GCCAGTATTCTGAGCTAACCAAATACCTGATTTGGACATCA 
AAGGTAGTATATATATTTTCAAATAG I I I I I I I I I I CTCAATTTGCAAAAGTTGCACCTT 
AGAGTCTATATAAATAGATTGCCAGCCAGGCCCCGAGGCTCATGCCTATGATCCCAGCAC 
[C,T] 
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TTGGGAGGCTGAGGCAGGTGGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAAC 
GTGGTGAAACCTTGTCTCCATTAAAAATACAAAAATTAGCCAGATGTGGTGATGCTTGCC 
TGTAGTCCCAGCTACTGCGAGGCTGAGGCAGGAGAATCACTTGAACCCAGGAAGTGGAGG 
CTGCAGTGAGCCGAGATCACCCCACTGCACTCCAGCCTGGGCCACAGAGTGAGACTCAGT 
CTCAAAAAAAG AAAAAAAAAAAAAAAAAAG ATTG CCTAATGTATCTAACAAAATTG G AAA 

87476 AG ATATCGG CTTCCTGG CTCTTCTAAG ATCATTGTCTTGTCTTCAAAG AG CAG GG AAGTT 

TAAAATTATAG CTG ATCCTCAG AG CATTTTATG AAATACCTAAGTTTGTATTTTG ACAAT 
ATACATTTAG AATTACAATTG GTG ATATAATATGGTATTTC CCAAG CAAAATATTCTCTA 
GAGCAGAGC I I I I ACTGTATAATTATTTTAAACCTG CTAGTTATAAGG AC AG AATG AACT 
TTAGCTGCATTCTGTGAAGTGGAGGGCCTTACCCTTCATAAAATTATCAAAGATACTAAT 
[C,G] 

GAAATGGGCTCAGATGGTGGTGCTCCATTAACATTATTATTATTTCTATTGTTGTTATTT 
TAI I I I CI I AAGCAGCTGGAAGGTAGGGAGATAACCTTGCCTTTCTGATCTACAGTGACC 
TTCCACAAAAAATTGCTACCTGTAGACATAAATTGGTCACAATAGAAAATCAATTAAATG 
CAGCATGTCAAGCTTGTGGCCTGCGCCAGCAATTGAAGAAGGGAGAAGGCAGATTTTGCC 
TCTGCTGGAGTAAGCTCATGATGTCCCTAGGCCTTGGTACGAATGTAAGGAGATAAAACT 
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